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COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF 

OVARIAN CANCER 

5 RELATED APPLICATIONS 

The present application claims priority to U.S. provisional patent application 
serial no. 60/191,03 1 filed on March 21, 2000, U.S. provisional patent application serial 
no. 60/207,124, filed on May 25, 2000, U.S. provisional patent application serial no. 
60/21 1,940, filed on June 15, 2000, U.S. provisional patent application serial no. 
10 60/216,820, filed on July 7, 2000, U.S. provisional patent application serial no. 

60/220,661, filed on July 25, 2000, and U.S. provisional patent application serial no. 
60/257,672, filed on December 21, 2000, all of which are expressly incorporated by 
reference. 

1 5 FIELD OF THE INVENTION 

The field of the invention is ovarian cancer, including diagnosis, 
characterization, management, and therapy of ovarian cancer. 

BACKGROUND OF THE INVENTION 
20 Ovarian cancer is responsible for significant morbidity and mortality in 

populations around the world. Ovarian cancer is classified, on the basis of clinical and 
pathological features, in three groups, namely epithelial ovarian cancer (EOC; >90% of 
ovarian cancer in Western countries), germ cell tumors {circa 2-3% of ovarian cancer), 
and stromal ovarian cancer (circa 5% of ovarian cancer; Ozols et aL, 1997, Cancer 
25 Principles and Practice of Oncology, 5th ed., DeVita et aL, Eds. pp. 1502). Relative to 
EOC, germ cell tumors and stromal ovarian cancers are more easily detected and treated 
at an early stage, translating into higherftetter survival rates for patients afflicted with 
these two types of ovarian cancer. 

There are numerous types of ovarian tumors, some of which are benign, and 
30 others of which are malignant. . Treatment (including non-treatment) options and 

predictions of patient outcome depend on accurate classification of the ovarian cancer. 
Ovarian cancers are named according to the type of cells from which the cancer is 
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derived and whether the ovarian cancer is benign or malignant. Recognized histological 
tumor types include, for example, serous, mucinous, endometrioid, and clear cell 
tumors. In addition, ovarian cancers are classified according to recognized grade and 
stage scales. 

5 In grade I, the tumor tissue is well differentiated from normal ovarian tissue. In 

grade II, tumor tissue is moderately well differentiated. In grade III, the tumor tissue is 
poorly differentiated from normal tissue, and this grade correlates with a less favorable 
prognosis than grades I and II. Stage I is generally confined within the capsule 
surrounding one (stage IA) or both (stage IB) ovaries, although in some stage I (i.e. 

1 0 stage IC) cancers, malignant cells may be detected in ascites, in peritoneal rinse fluid, or 
on the surface of the ovaries. Stage II involves extension or metastasis of the tumor 
from one or both ovaries to other pelvic structures. In stage IIA, the tumor extends or 
has metastasized to the uterus, the fallopian tubes, or both. Stage IIB involves extension 
of the tumor to the pelvis. Stage IIC is stage IIA or IIB in which malignant cells may be 

15 detected in ascites, in peritoneal rinse fluid, or on the surface of the ovaries. In stage III, 
the tumor comprises at least one malignant extension to the small bowel or the 
omentum, has formed extrapelvic peritoneal implants of microscopic (stage IIIA) or 
macroscopic (< 2 centimeter diameter, stage IIIB; > 2 centimeter diameter, stage IIIC) 
size, or has metastasized to a retroperitoneal or inguinal lymph node (an alternate 

20 indicator of stage IIIC). In stage IV, distant (Le. non-peritoneal) metastases of the tumor 
can be detected. 

The durations of the various stages of ovarian cancer are not presently known, 
but are believed to be at least about a year each (Richart et al, 1969, Am. */. Obstet. 
Gynecol 105:386). Prognosis declines with increasing stage designation. For example, 

25 5-year survival rates for patients diagnosed with stage I, II, III, and IV ovarian cancer 
are 80%, 57%, 25%, and 8%, respectively. 

Despite being the third most prevalent gynecological cancer, ovarian cancer is 
the leading cause of death among those afflicted with gynecological cancers. The 
disproportionate mortality of ovarian cancer is attributable to a substantial absence of 

30 symptoms among those afflicted with early-stage ovarian cancer and to difficulty 

diagnosing ovarian cancer at an early stage. Patients afflicted with ovarian cancer most 
often present with non-specific complaints, such as abnormal vaginal bleeding, 
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gastrointestinal symptoms, urinary tract symptoms, lower abdominal pain, and 
generalized abdominal distension. These patients rarely present with paraneoplastic 
symptoms or with symptoms which clearly indicate their affliction. Presently, less than 
about 40% of patients afflicted with ovarian cancer present with stage I or stage II. 
5 Management of ovarian cancer would be significantly enhanced if the disease could be 
detected at an earlier stage, when treatments are much more generally efficacious. 

Ovarian cancer may be diagnosed, in part, by collecting a routine medical history 
from a patient and by performing physical examination, x-ray examination, and 
chemical and hematological studies on the patient. Hematological tests which may be 

10 indicative of ovarian cancer in a patient include analyses of serum levels of proteins 
designated CA125 and DF3 and plasma levels of lysophosphatidic acid (LP A). 
Palpation of the ovaries and ultrasound techniques (particularly including endovaginal 
ultrasound and color Doppler flow ultrasound techniques) can aid detection of ovarian 
tumors and differentiation of ovarian cancer from benign ovarian cysts. However, a 

1 5 definitive diagnosis of ovarian cancer typically requires performing exploratory 
laparotomy of the patient. 

Potential tests for the detection of ovarian cancer (e.g., screening, reflex or 
monitoring) may be characterized by a number of factors. The "sensitivity" of an assay 
refers to the probability that the test will yield a positive result in an individual afflicted 

20 with ovarian cancer. The "specificity" of an assay refers to the probability that the test 
will yield a negative result in an individual not afflicted with ovarian cancer. The 
"positive predictive value" (PPV) of an assay is the ratio of true positive results (i.e. 
positive assay results for patients afflicted with ovarian cancer) to all positive results 
(te. positive assay results for patients afflicted with ovarian cancer + positive assay 

25 results for patients not afflicted with ovarian cancer). It has been estimated that in order 
for an assay to be an appropriate population-wide screening tool for ovarian cancer the 
assay must have a PPV of at least about 10% (Rosenthal et ai, 1998, Sem. Oncol 
25:3 15-325). It would thus be desirable for a screening assay for detecting ovarian 
cancer in patients to have a high sensitivity and a high PPV. Monitoring and reflex tests 

30 would also require appropriate specifications. 
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Owing to the cost, limited sensitivity, and limited specificity of known methods 
of detecting ovarian cancer, screening is not presently performed for the general 
population. In addition, the need to perform laparotomy in order to diagnose ovarian 
cancer in patients who screen positive for indications of ovarian cancer limits the 
5 desirability of population-wide screening, such that a PPV even greater than 10% would 
be desirable. 

Prior use of serum CA125 level as a diagnostic marker for ovarian cancer 
indicated that this method exhibited insufficient specificity for use as a general 
screening method. Use of a refined algorithm for interpreting CA125 levels in serial 

1 0 retrospective samples obtained from patients improved the specificity of the method 
without shifting detection of ovarian cancer to an earlier stage (Skakes, 1995, Cancer 
76:2004). Screening for LPA to detect gynecological cancers including ovarian cancer 
exhibited a sensitivity of about 96% and a specificity of about 89%. However, CA125- 
based screening methods and LPA-based screening methods are hampered by the 

1 5 presence of CA1 25 and LPA, respectively, in the serum of patients afflicted with 

conditions other than ovarian cancer. For example, serum CA125 levels are known to 
be associated with menstruation, pregnancy, gastrointestinal and hepatic conditions such 
as colitis and cirrhosis, pericarditis, renal disease, and various non-ovarian malignancies. 
Serum LPA is known, for example, to be affected by the presence of non-ovarian 

20 gynecological malignancies. A screening method having a greater specificity for 

ovarian cancer than the current screening methods for CA125 and LPA could provide a 
population- wide screening for early stage ovarian cancer. 

Presently greater than about 60% of ovarian cancers diagnosed in patients are 
stage III or stage IV cancers. Treatment at these stages is largely limited to 

25 cytoreductive surgery (when feasible) and chemotherapy, both of which aim to slow the 
spread and development of metastasized tumor. Substantially all late stage ovarian 
cancer patients currently undergo combination chemotherapy as primary treatment, 
usually a combination of a platinum compound and a taxane. Median survival for 
responding patients is about one year. Combination chemotherapy involving agents 

30 such as doxorubicin, cyclophosphamide, cisplatin, hexamethylmelamine, paclitaxel, and 
methotrexate may improve survival rates in these groups, relative to single-agent 
therapies. Various recently-developed chemotherapeutic agents and treatment regimens 
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have also demonstrated usefulness for treatment of advanced ovarian cancer. For 
example, use of the topoisomerase I inhibitor topectan, use of amifostine to minimize 
chemotherapeutic side effects, and use of intraperitoneal chemotherapy for patients 
having peritoneally implanted tumors have demonstrated at least limited utility. 
5 Presently, however, the 5 -year survival rate for patients afflicted with stage III ovarian 
cancer is 25%, and the survival rate for patients afflicted with stage IV ovarian cancer is 
8%. 

In summary, the earlier ovarian cancer is detected, the aggressiveness of 
therapeutic intervention and the side effects associated with therapeutic intervention are 
10 minimized. More importantly, the earlier the cancer is detected, the survival rate and 
quality of life of ovarian cancer patients is enhanced. Thus, a pressing need exists for 
methods of detecting ovarian cancer as early as possible. There also exists a need for 
methods of detecting recurrence of ovarian cancer as well as methods for predicting and 
monitoring the efficacy of treatment. The present invention satisfies these needs. 

15 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with ovarian cancer as well as 
methods of assessing whether a patient is afflicted with ovarian cancer. This method 
comprises the step of comparing the level of expression of a marker in a patient , sample, 

20 wherein the marker is listed in Tables 1 -2, and the normal level of expression of the 
marker in a control, e.g., a sample from a patient without ovarian cancer. A significant 
difference between the level of expression of the marker in the patient sample and the 
normal level is an indication that the patient is afflicted with ovarian cancer. Preferably, 
a protein corresponding to the marker is a secreted protein. Alternatively, the marker 

25 can correspond to a protein having an extracellular portion, to one which is normally 
expressed in ovarian tissue at a detectable level, or both. 

In one method, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 
the method wherein the marker is over- or under-expressed by at least two-fold in at 

30 least about 20% of stage I ovarian cancer patients, stage II ovarian cancer patients, stage 
III ovarian cancer patients, stage IV ovarian cancer patients, grade I ovarian cancer 
patients, grade II ovarian cancer patients, grade III ovarian cancer patients, epithelial 
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ovarian cancer patients, stromal ovarian cancer patients, germ cell ovarian cancer 
patients, malignant ovarian cancer patients, benign ovarian patients, serous neoplasm 
ovarian cancer patients, mucinous neoplasm ovarian cancer patients, endometrioid 
neoplasm ovarian cancer patients and/or clear cell neoplasm ovarian cancer patients. 
5 In one embodiment of the methods of the present invention, the patient sample is 

an ovary-associated body fluid. Such fluids include, for example, blood fluids, lymph, 
ascitic fluids, gynecological fluids, cystic fluids, urine, and fluids collected by peritoneal 
rinsing. In another embodiment, the sample comprises cells obtained from the patient. 
In this embodiment, the cells may be found in a fluid selected from the group consisting 

10 of a fluid collected by peritoneal rinsing, a fluid collected by uterine rinsing, a uterine 
fluid, a uterine exudate, a pleural fluid, and an ovarian exudate. In another embodiment, 
the patient sample is in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 

1 5 sample of : 

• a protein corresponding to the marker or fragment of the protein {e.g. 
using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein) 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
20 thereof, having at least a portion with which the marker is substantially 

homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-2 fixed thereto at selected positions) 

• a transcribed polynucleotide or fragment thereof, wherein the 

25 polynucleotide anneals with the marker under stringent hybridization 

conditions. 

• a metabolite which is produced directly (i.e., catalyzed) or indirectly by a 
protein corresponding to the marker 

The methods of the present invention are particularly useful for patients with an 
30 identified pelvic mass or symptoms associated with ovarian cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
developing ovarian cancer (e.g., patients having a familial history of ovarian cancer, 



WO 01/070979 



PCT/US01/09126 



-7- 

patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in. 
monitoring the efficacy of treatment of an ovarian cancer patient (e.g. the efficacy of 
chemotherapy). 

5 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 10 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-2 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 

0 control humans not afflicted with ovarian cancer. The markers of Tables 1-2 may also 
be used in combination with known ovarian cancer markers in the methods of the 
present invention. 

In a preferred method of assessing whether a patient is afflicted with ovarian 
cancer (e.g., new detection ("screening"), detection of recurrence, reflex testing), the 
5 method comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control non-ovarian 
cancer sample. 

0 A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with ovarian 
cancer. 

The methods of the present invention further include a method of assessing the 
efficacy of a test compound for inhibiting ovarian cancer in a patient This method 
5 comprises comparing: 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-2, 
and 

0 b) expression of the marker in a second sample obtained from the patient 

and maintained in the absence of the test compound. 
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A significant difference between the level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting ovarian cancer in the patient. For example, the first and 
second samples can be portions of a single sample obtained from the patient or portions 
5 of pooled samples obtained from the patient. 

The invention further relates to a method of assessing the efficacy of a therapy 
for inhibiting ovarian cancer in a patient. This method comprises comparing: 

a) expression of a marker in a first sample obtained from the patient prior 
to providing at least a portion of the therapy to the patient, wherein the 

10 marker is selected from the group consisting of the markers listed in 

Tables 1-2, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy. 

A significant difference between the level of expression of the marker in the second 
1 5 sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any traditional 
therapy for treating ovarian cancer including, but not limited to, chemotherapy, radiation 
therapy and surgical removal of tissue, e.g., an ovarian tumor. Thus, the methods of the 
20 invention may be used to evaluate a patient before, during and after thereapy, for 
example, to evaluate the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 
progression of ovarian cancer in a patient, the method comprising: 

a) detecting in a patient sample at a first time point, the expression of a marker, 
25 wherein the marker is selected from the group consisting of the markers listed in Tables 

1-2; 

b) repeating step a) at a subsequent time point in time; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
monitoring the progression of ovarian cancer in the patient. 

30 The invention also includes a method of selecting a composition for inhibiting 

ovarian cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 



WO 01/070979 



PCT7US01/09126 



-9- 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) selecting one of the test compositions which alters the level of 

expression of the marker in the aliquot containing that test composition, 
relative to other test compositions. 
In addition, the invention includes a method of inhibiting ovarian cancer in a 
patient. This method comprises the steps of: 
0 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) administering to the patient at least one of the test compositions which 

alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 
The invention also includes a kit for assessing whether a patient is.afQicted with 
ovarian cancer. This kit comprises reagents for assessing expression of a marker listed 
0 in Tables 1-2. 

In another aspect, the invention relates to a kit for assessing the suitability of 
each of a plurality of compounds for inhibiting an ovarian cancer in a patient. The kit 
comprises a reagent for assessing expression of a marker listed in Tables 1-2, and may 
also comprise a plurality of compounds. 

5 In another aspect, the invention relates to a kit for assessing the presence of 

ovarian cancer cells. This kit comprises an antibody, wherein the antibody binds 
specifically with a protein corresponding to a marker listed in Tables 1-2. The kit may 
also comprise a plurality of antibodies, wherein the plurality binds specifically with a 
protein corresponding to a different marker listed in Tables 1-2. 

) The invention also includes a kit for assessing the presence of ovarian cancer 

cells, wherein the kit comprises a nucleic acid probe. The probe binds specifically with 
a transcribed polynucleotide corresponding to a marker listed in Tables 1-2. The kit 
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may also comprise a plurality of probes, wherein each of the probes binds specifically 
with a transcribed polynucleotide corresponding to a different marker listed in Tables 1- 
2. 

The invention further relates to a method of making an isolated hybridoma which 
5 produces an antibody useful for assessing whether a patient is afflicted with ovarian 
cancer. The method comprises isolating a protein corresponding to a marker listed in 
Tables 1-2, immunizing a mammal using the isolated protein, isolating splenocytes from 
the immunized mammal, fusing the isolated splenocytes with an immortalized cell line 
to form hybridomas, and screening individual hybridomas for production of an antibody 
10 which specifically binds with the protein to isolate the hybridoma. The invention also 
includes an antibody produced by this method. 

The invention further includes a method of assessing the ovarian carcinogenic 
potential of a test compound. This method comprises the steps of: 

a) maintaining separate aliquots of ovarian cells in the presence and 
1 5 absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
The marker is selected from those listed in Tables 1-2. A significantly altered 
level of expression of the marker in the aliquot maintained in the presence of (or 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
20 test compound, is an indication that the test compound possesses ovarian carcinogenic 
potential. 

Additionally, the invention includes a kit for assessing the ovarian carcinogenic 
potential of a test compound. The kit comprises ovarian cells and a reagent for 
assessing expression of a marker in each of the aliquots. The marker is selected from 

25 those listed in Tables 1-2. 

The invention further relates to a method of treating a patient afflicted with 
ovarian cancer or at risk of developing ovarian cancer. This method comprises 
enhancing expression of a marker listed in Tables 1-2 or providing to cells of the patient 
a protein corresponding to a marker listed in Tables 1-2, wherein the marker is 

30 underexpressed in patients afflicted with ovarian cancer. The protein can be provided to 
the cells, for example, by providing a vector comprising a polynucleotide encoding the 
protein to the cells. 
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The invention includes another method of treating a patient afflicted with ovarian 
cancer or at risk of developing ovarian cancer. This method comprises inhibiting 
expression or overexpression of a marker listed in Tables 1-2 by, e.g., providing to cells 
of the patient an antisense oligonucleotide complementary to a polynucleotide 
5 corresponding to a marker listed in Tables 1 -2, wherein the marker is overexpressed in 
patients afflicted with ovarian cancer. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known ovarian cancer markers. It will further 
be appreciated that the methods and kits may be used to identify cancers other than 
10 ovarian cancer. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered genes associated with the cancerous 
state of ovarian cells. It has been discovered that the level of expression of individual 
15 genes, also referred to as markers, and combinations of these genes, correlates with the 
presence of ovarian cancer in a patient. Methods are provided for detecting the presence 
of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an 
ovarian cancer, and with other characteristics of ovarian cancer that are relevant to 
prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. 

20 

Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles "a" and "an 11 are used herein to refer to one or to more than one (i.e. 
25 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-2. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA {i.e. including any introns 
30 occurring therein), RNA generated by transcription of genomic DNA (i. e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (i.e. including proteins both before and 
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after cleavage of normally cleaved regions such as transmembrane signal sequences). 
As used herein, "marker" may also include a cDNA made by reverse transcription of an 
RNA generated by transcription of genomic DNA (including spliced RNA). 

The term "probe" refers to any molecule which is capable of selectively binding 
5 to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
10 and organic monomers. 

An "ovary-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through ovarian cells or into which cells or proteins shed from 
ovarian cells e.g. ovarian epithelium, are capable of passing. Exemplary ovary- 
associated body fluids include blood fluids, lymph, ascites, gynecological fluids, cystic 
1 5 fluid, urine, and fluids collected by peritoneal rinsing. 

The "normal" level of expression of a marker is the level of expression of the 
marker in ovarian cells of a patient, e.g. a human, not afflicted with ovarian cancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
marker of a patient at a greater or lesser level, respectively, than normal level of 
20 expression of the marker (e.g. at least two-fold greater or lesser level). 

As used herein, the term "promoter/regulatory sequence' 1 means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
promoter sequence and in other instances, this sequence may also include an enhancer 
25 sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell under most or all physiological conditions 
of the cell. 
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An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
5 A "tissue-specific" promoter is a nucleotide sequence which, when operably 

linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 

A "transcribed polynucleotide" is a polynucleotide (e.g. an RNA, a cDNA, or an 

1 0 analog of one of an RNA or cDNA) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

"Complementary" refers to the broad concept of sequence complementarity 

15 between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic i. 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 
Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of - 

20 base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 
capable of base pairing with a residue of the second region. Preferably, the first region 

25 comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portion are capable of base pairing with nucleotide 
residues in the second portion. More preferably, all nucleotide residues of the first 

30 portion are capable of base pairing with nucleotide residues in the second portion. 
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"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by the 
same nucleotide residue, then the regions are homologous at that position. A first region 
5 is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence S'-ATTGCC-S' and a region having the nucleotide sequence 5'- 

10 TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 

15 the same nucleotide residue. 

A marker is "fixed" to a substrate if it is covalently or non-covalently associated 
with the substrate such the substrate can be rinsed with a fluid (e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

20 As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA 

or DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 

25 or less, respectively, than the normal level by an amount greater than the standard error 
of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount. Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 

30 about three, four, or five times, higher or lower, respectively, than the normal level of 
expression of the marker. 
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Ovarian cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, ovarian cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 
reagent, e.g. a probe, for specifically detecting a marker of the invention, the 
manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 

Description 

The present invention is based, in part, on identification of novel markers which 
are over-expressed in ovarian cancer cells as compared to their expression in normal (i.e. 
non- cancerous) ovarian cells. The markers of the invention correspond to DNA, RNA, 
and polypeptide molecules which can be detected in one or both of normal and 
cancerous ovarian cells. The enhanced expression of one or more of these markers in 
ovarian cells is herein correlated with the cancerous state of the tissue. The invention 
thus includes compositions, kits, and methods for assessing the cancerous state of 
ovarian cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 
among others: 

1) assessing whether a patient is afflicted with ovarian cancer; 

2) assessing the stage of ovarian cancer in a human patient; 

3) assessing the grade of ovarian cancer in a patient; 

4) assessing the benign or malignant nature of ovarian cancer in a patient; 

5) assessing the histological type of neoplasm (e.g. serous, mucinous, 



endometroid, or clear cell neoplasm) associated with ovarian 
cancer in a patient; 



making an isolated hybridoma which produces an antibody useful for 
assessing whether a patient is afflicted with ovarian cancer; 



7) 
8) 



assessing the presence of ovarian cancer cells; 



assessing the efficacy of one or more test compounds for inhibiting 
ovarian cancer in a patient; 
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9) assessing the efficacy of a therapy for inhibiting ovarian cancer in a 

patient; 

1 0) monitoring the progression of ovarian cancer in a patient; 

11) selecting a composition or therapy for inhibiting ovarian cancer in a 
5 patient; 

1 2) treating a patient afflicted with ovarian cancer; 

1 3) inhibiting ovarian cancer in a patient; 

14) assessing the ovarian carcinogenic potential of a test compound; 
and 

10 15) inhibiting an ovarian cancer in a patient at risk for developing 

ovarian cancer. 



The invention thus includes a method of assessing whether a patient is afflicted 
with ovarian cancer. This method comprises comparing the level of expression of a 

1 5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-ovarian cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with ovarian cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-2. 

20 The polynucleotides set forth in Tables 1-2 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. Also provided by this invention are 
polynucleotides that correspond to the polynucleotides of Tables 1-2. In one 
embodiment, these polynucleotides are obtained by identification of a larger fragment or 

25 full-length coding sequence of these polynucleotides. Gene delivery vehicles, host cells, 
compositions and databases (all described herein) containing these polynucleotides are 
also provided by this invention. 

Any marker or combination of markers listed in Tables 1-2, as well as any 
known markers in combination with the markers set forth in Tables 1-2, may be used in 

30 the compositions, kits, and methods of the present invention. In general, it is preferable 
to use markers for which the difference between the level of expression of the marker in 
ovarian cancer cells and the level of expression of the same marker in normal ovarian 
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cells is as great as possible. Although this difference can be as small as the limit of 
detection of the method for assessing expression of the marker, it is preferred that the 
difference be at least greater than the standard error of the assessment method, and 
preferably a difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 
5 500-, 1 000-fold or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from ovarian cells (i.e. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

10 corresponding to each of these markers can be detected in an ovary-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

15 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

It is a simple matter for the skilled artisan to determine whether any particular 
marker corresponds to a secreted protein. In order to make this determination, the 
protein corresponding to a marker is expressed in a test cell (e.g. a cell of an ovarian cell 

20 line), extracellular fluid is collected, and the presence or absence of the protein in the 
extracellular fluid is assessed (e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion 
of a protein corresponding to a marker of the invention. About 8 x 10 5 293T cells are 

25 incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 10% fetal bovine serum) under a 5% (v/v) C0 2 , 
95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMINE™ 

30 (GIBCO/BRL Catalog no. 1 8342-012) per well. The transfection mixture is maintained 
for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
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methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 

35 

of DMEM-MC and about 50 microcuries of Trans- S™ reagent (ICN Catalog no. 
51006) are added to each well. The wells are maintained under the 5% C0 2 atmosphere 
described above and incubated at 37°C for a selected period. Following incubation, 150 
5 microliters of conditioned medium is removed and centriflxged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of ovary-associated body fluids include blood fluids {e.g. whole blood, 
blood serum, blood having platelets removed therefrom, etc.), lymph, ascitic fluids, 

10 gynecological fluids {e.g. ovarian, fallopian, and uterine secretions, menses, vaginal 
douching fluids, fluids used to rinse cervical cell samples, etc.), cystic fluid, urine, and 
fluids collected by peritoneal rinsing {e.g. fluids applied and collected during 
laparoscopy or fluids instilled into and withdrawn from the peritoneal cavity of a human 
patient). In these embodiments, the level of expression of the marker can be assessed by 

1 5 assessing the amount {e.g. absolute amount or concentration) of the marker in an ovary- : 
associated body fluid obtained from a patient. The fluid can, of course, be subjected to a 
variety of well-known post-collection preparative and storage techniques {e.g. storage, 
freezing, ultrafiltration, concentration, evaporation, centrifugation, etc.) prior to 
assessing the amount of the marker in the fluid. 

20 Many ovary-associated body fluids {i.e. usually excluding urine) can have 

ovarian cells, e.g. ovarian epithelium, therein, particularly when the ovarian cells are 
cancerous, and, more particularly, when the ovarian cancer is metastasizing. Cell- 
containing fluids which can contain ovarian cancer cells include, but are not limited to, 
peritoneal ascites, fluids collected by peritoneal rinsing, fluids collected by uterine 

25 rinsing, uterine fluids such as uterine exudate and menses, pleural fluid, and ovarian 
exudates. Thus, the compositions, kits, and methods of the invention can be used to 
detect expression of markers corresponding to proteins having at least one portion which 
is displayed on the surface of cells which express it. Examples of such proteins are 
indicated in the Tables herein. Although not every protein having at least one cell- 

30 surface portion is indicated in the Tables, it is a simple matter for the skilled artisan to 
determine whether the protein corresponding to any particular marker comprises a cell- 
surface protein. For example, immunological methods may be used to detect such 
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proteins on whole cells, or well known computer-based sequence analysis methods (e.g. 
the SIGNALP program; Nielsen etal, 1997, Protein Engineering 10:1-6) may be used 
to predict the presence of at least one extracellular domain (i.e. including both secreted 
proteins and proteins having at least one cell-surface domain). Expression of a marker 
5 corresponding to a protein having at least one portion which is displayed on the surface 
of a cell which expresses it may be detected without necessarily lysing the cell (e.g. 
using a labeled antibody which binds specifically with a cell-surface domain of the 
protein). 

Expression of a marker of the invention may be assessed by any of a wide 

1 0 variety of well known methods for detecting expression of a transcribed molecule or 
protein. Non-limiting examples of such methods include immunological methods for 
detection of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification 
methods, protein function or activity assays, nucleic acid hybridization methods, nucleic 
acid reverse transcription methods, and nucleic acid amplification methods. 

15 In a preferred embodiment, expression of a marker is assessed using an antibody 

(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 
fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 

20 which binds specifically with a protein corresponding to the marker, such as the protein 
encoded by the open reading frame corresponding to the marker or such a protein which 
has undergone all or a portion of its normal post-translational modification. 

In another preferred embodiment, expression of a marker is assessed by 
preparing mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient 

25 sample, and by hybridizing the mRNA/cDNA with a reference polynucleotide which is a 
complement of a polynucleotide comprising the marker, and fragments thereof. cDNA 
can, optionally, be amplified using any of a variety of polymerase chain reaction 
methods prior to hybridization with the reference polynucleotide; preferably, it is not 
amplified. Expression of one or more markers can likewise be detected using 

30 quantitative PCR to assess the level of expression of the marker(s). Alternatively, any 
of the many known methods of detecting mutations or variants (e.g. single nucleotide 
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polymorphisms, deletions, etc.) of a marker of the invention may be used to detect 
occurrence of a marker in a patient. 

In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
5 complementary to or homologous with at least a portion (e.g. at least 7, 10, 15, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 

10 be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 
expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be performed under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 

15 a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 
detection limit of the method used to assess expression in at least one of normal ovarian 
cells and cancerous ovarian cells. 

It is understood that by routine screening of additional patient samples using one 

20 or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific ovarian cancers, 
as well as other cancers such as breast cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (ie. 50% or more) or substantially all (i.e. 80% or more) of ovarian cancer. 

25 Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with ovarian cancer of various stages (ie. stage I, II, III, and IV ovarian 
cancers, as well as subclassifications IA, IB, IC, IIA, IIB, IIC, IIIA, IIIB, and IIIC, using 
the FIGO Stage Grouping system for primary carcinoma of the ovary; 1987, Am. J. 
Obstet. Gynecol. 156:236), of various histologic subtypes (e.g. serous, mucinous, 

30 endometroid, and clear cell subtypes, as well as subclassifications and alternate 
classifications adenocarcinoma, papillary adenocarcinoma, papillary 
cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
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cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
endometrioid stromal sarcoma, mesodermal (Miillerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
5 ovarian tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
various grades (i.e. grade I {well differentiated} , grade II {moderately, well 
differentiated}, and grade III {poorly differentiated from surrounding normal tissue} ). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 

10 samples were obtained are correlated, it will also be confirmed that altered expression of 
certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 

1 5 nature of ovarian cancer in patients. In addition, these compositions, kits, and methods 
can be used to detect and differentiate epithelial, stromal, and germ cell ovarian cancers. 

When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of ovarian cancer in a patient, it is preferred that the marker or panel of markers 

20 of the invention is selected such that a positive result is obtained in at least about 20%, 
and preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an ovarian cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 

25 general population (more preferably coupled with an assay specificity greater than 
99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
30 in non-cancerous samples of the same type, either in a single reaction mixture (Le. using 
reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
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significantly enhanced level of expression of more than one of the plurality of markers 
in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with ovarian cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
5 corresponding normal levels, is an indication that the patient is afflicted with ovarian 
cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more marks and a significantly lower level of expression of one or more markers in a 
sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with ovarian cancer. When a plurality of markers is used, it is preferred that 2, 

10 3, 4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 
markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (i.e. by interference attributable to cells of non-ovarian origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 

15 has a restricted tissue distribution, e.g., normally not expressed in a non-epithelial tissue, 
and more preferably a marker which is normally not expressed in a non-ovarian tissue. 

Only a small number of markers are known to be associated with ovarian cancers 
(e.g. AKT2, Ki-RAS, ERBB2, c-MYC, RBI, and TP53; Lynch, supra). These markers are 
not, of course, included among the markers of the invention, although they may be used 

20 together with one or more markers of the invention in a panel of markers, for example. 
It is well known that certain types of genes, such as oncogenes, tumor suppressor genes, 
growth factor-like genes, protease-like genes, and protein kinase-like genes are often 
involved with development of cancers of various types. Thus, among the markers of the 
invention, use of those which correspond to proteins which resemble known proteins 

25 encoded by known oncogenes and tumor suppressor genes, and those which correspond 
to proteins which resemble growth factors, proteases, and protein kinases are preferred. 

Known oncogenes and tumor suppressor genes include, for example, abl, abr, 
akt2, ape, bc\2a, bcl2j3, bc!3, bcr, brcal, brca2, cbl, ccndl, cdc42, cdk4, crk- II, 
csflrlfins, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfrlerbb-1, elkl, elk3, eph, erg, etsl, 

30 ets2,fer,fgrlsrc2,flillergb2,fos,Jpslfes,jral,fra hck, hek, her2lerbb- 2lneu y 
her3/erbb-3, her4lerbb-4, hrasl, hst2, hstfl, igfbp2, ink4a, ink4b, int2lfgf3,jun,junb, 
jund, kip2, kit, kras2a, kras2b, Ick, lyn, mas, max, mcc, mdm2 } met, mlhl, mmplO, mos, 
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msh2, msh3, msh6, myb, myba, mybb, myc, mycll, mycn, nfl, nf2, nme2, nras,p53, 
pdgfb,phb, piml , pmsl , pms2, ptc, pten, rafl, rapla, rbl 9 rel, ret, rosl, ski, srcl, tall, 
tgfbr2, tgfb3, tgfbr3, thral, thrb, tiaml, timp3, tjpl, tp53, trk, vav, vhl, vil2, wafl, wntl, 
wnt2, wtl, and yes 1 (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 
5 Facts Book, 2nd Ed., Academic Press; Fishel et al, 1994, Science 266:1403-1405). 

Known growth factors include platelet-derived growth factor alpha, platelet- 
derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 
thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 

10 stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 
growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 

15 connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 
factor 1, vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 
growth factor (vascular endothelial growth factor-related protein), transforming growth . 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 

20 factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 
and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 

25 brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 
and its precursors, Mch2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 

30 72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 
(macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
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cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
5 plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
NIK, PK428, Krs-2, SAK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-like midbody-associated protein 
kinase-1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 

10 checkpoint kinase Chkl, Nek3 protein kinase, BMK1 beta kinase, Clkl, Clk2, Clk3, 
extracellular signal-regulated kinases 1, 3, and 6, cdc28 protein kinase 1, cdc28 protein 
kinase 2, pLK, Mytl, c-Jun N-terminal kinase 2, Cam kinase 1, the MAP kinases, 
insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 

15 Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 
associated coiled-coil containing protein kinase pl60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- 

20 activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl, lipid-activated protein kinase PRK2, focal adhesion kinase, dual-specificity 
tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 

25 Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology. 2 protein), agammaglobulinaemia protein-tyrosine kinase (Atk), protein 
kinase ATR, guanylate kinase 1, thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 

30 (Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fujinami avian sarcoma viral oncogene 
homologs), proto-oncogene c-cot, oncogene pim-1, and oncogene masL 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing ovarian cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
ovarian cancer include, for example, patients having a familial history of ovarian cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age (i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. non-cancerous) human 
ovarian tissue can be assessed in a variety of ways. In one embodiment, this normal 
level of expression is assessed by assessing the level of expression of the marker in a 

10 portion of ovarian cells which appears to be non-cancerous and by comparing this 

normal level of expression with the level of expression in a portion of the ovarian cells 
which is suspected of being cancerous. For example, when laparoscopy or other 
medical procedure, reveals the presence of a lump on one portion of a patient's ovary, 
but not on another portion of the same ovary or on the other ovary, the normal level of 

1 5 expression of a marker may be assessed using one or both or the non-affected ovary and 
a non-affected portion of the affected ovary, and this normal level of expression may be 
compared with the level of expression of the same marker in an affected portion (i.e. the 
lump) of the affected ovary. Alternately, and particularly as further information 
becomes available as a result of routine performance of the methods described herein, 

20 population-average values for normal expression of the markers of the invention may be 
used. In other embodiments, the 'normal' level of expression of a marker may be 
determined by assessing expression of the marker in a patient sample obtained from a 
non-cancer-afflicted patient, from a patient sample obtained from a patient before the 
suspected onset of ovarian cancer in the patient, from archived patient samples, and the 

25 like. 

The invention includes compositions, kits, and methods for assessing the 
presence of ovarian cancer cells in a sample (e.g. an archived tissue sample or a sample 
obtained from a patient). These compositions, kits, and methods are substantially the 
same as those described above, except that, where necessary, the compositions, kits, and 
30 methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 
necessary to adjust the ratio of compounds in the compositions of the invention, in the 
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kits of the invention, or the methods used to assess levels of marker expression in the 
sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of ovarian cancer cells 
5 (e.g. in a sample such as a patient sample). The kit comprises a plurality of reagents, 
each of which is capable of binding specifically with a nucleic acid or polypeptide 
corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
10 nucleic acid (e.g. a genomic DNA 3 an mRNA, a spliced mRNA, a cDNA, or the like) 
include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

1 5 The kit of the invention may optionally comprise additional components useful 

for performing the methods of the invention. By way of example, the kit may comprise . - 
fluids (e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the '« 

20 invention, a sample of normal ovarian cells, a sample of ovarian cancer cells, and the 
like. 

The invention also includes a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether patient is afflicted with an ovarian 
cancer. In this method, a protein corresponding to a marker of the invention is isolated 

25 (e.g. by purification from a cell in which it is expressed or by transcription and 
translation of a nucleic acid encoding the protein in vivo or in vitro using known 
methods). A vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is 
immunized using the isolated protein. The vertebrate may optionally (and preferably) be 
immunized at least one additional time with the isolated protein, so that the vertebrate 

30 exhibits a robust immune response to the protein. Splenocytes are isolated from the 
immunized vertebrate and fused with an immortalized cell line to form hybridomas, 
using any of a variety of methods well known in the art. Hybridomas formed in this 
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manner are then screened using standard methods to identify one or more hybridomas 
which produce an antibody which specifically binds with the protein. The invention 
also includes hybridomas made by this method and antibodies made using such 
hybridomas. 

5 The invention also includes a method of assessing the efficacy of a test 

compound for inhibiting ovarian cancer cells. As described above, differences in the 
level of expression of the markers of the invention correlate with the cancerous state of 
ovarian cells. Although it is recognized that changes in the levels of expression of 
certain of the markers of the invention likely result from the cancerous state of ovarian 

1 0 cells, it is likewise recognized that changes in the levels of expression of other of the 
markers of the invention induce, maintain, and promote the cancerous state of those 
cells. Thus, compounds which inhibit an ovarian cancer in a patient will cause the level 
of expression of one or more of the markers of the invention to change to a level nearer 
the normal level of expression for that marker {i.e. the level of expression for the marker 

1 5 in non-cancerous ovarian cells). 

This method thus comprises comparing expression of a marker in a first ovarian 
cell sample and maintained in the presence of the test compound and expression of the 
marker in a second ovarian cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 

20 Tables 1-2 is an indication that the test compound inhibits ovarian cancer. The ovarian 
cell samples may, for example, be aliquots of a single sample of normal ovarian cells 
obtained from a patient, pooled samples of normal ovarian cells obtained from a patient, 
cells of a normal ovarian cell line, aliquots of a single sample of ovarian cancer cells 
obtained from a patient, pooled samples of ovarian cancer cells obtained from a patient, 

25 cells of an ovarian cancer cell line, or the like. In one embodiment, the samples are 
ovarian cancer cells obtained from a patient and a plurality of compounds known to be 
effective for inhibiting various ovarian cancers are tested in order to identify the 
compound which is likely to best inhibit the ovarian cancer in the patient. 

This method may likewise be used to assess the efficacy of a therapy for 

30 inhibiting ovarian cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
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efficacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-2 then the therapy is efficacious for inhibiting 
ovarian cancer. As above, if samples from a selected patient are used in this method, 
then alternative therapies can be assessed in vitro in order to select a therapy most likely 
5 to be efficacious for inhibiting ovarian cancer in the patient. 

As described herein, ovarian cancer in patients is associated with an altereration 
in the level of expression of one or more markers listed in Tables 1-2. While, as 
discussed above, some of these changes in expression level result from occurrence of the 
ovarian cancer, others of these changes induce, maintain, and promote the cancerous 

10 state of ovarian cancer cells. Thus, ovarian cancer characterized by an increase in the 
level of expression of one or more markers listed in either or both of Tables 1-2 can be 
inhibited by inhibiting expression of those markers. 

Expression of a marker listed in Tables 1-2 can be inhibited in a number of ways 
generally known in the art. For example, an antisense oligonucleotide can be provided 

15 to the ovarian cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the function or activity of the protein corresponding to the marker(s). Using 

20 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the marker(s). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the ovarian cancer cells of the patient. 

25 Expression of a marker listed in Tables 1-2 can be enhanced in a number of ways 

generally known in the art. For example, a polynucleotide encoding the marker and 
operably linked with an appropriate promoter/regulator region can be provided to 
ovarian cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 

30 capable of crossing the cell membrane, inserting itself in the cell membrane, or is 

normally a secreted protein, then expression of the protein can be enhanced by providing 
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the protein (e.g. directly or by way of the bloodstream or another ovary-associated fluid) 
to ovarian cancer cells in the patient. 

As described above, the cancerous state of human ovarian cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention 
5 includes a method for assessing the human ovarian cell carcinogenic potential of a test 
compound. This method comprises maintaining separate aliquots of human ovarian 
cells in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-2 in the aliquot maintained in the presence of 

10 the test compound (relative to the aliquot maintained in the absence of the test 
compound) is an indication that the test compound possesses human ovarian cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 

1 5 level of expression is enhanced or inhibited, or by comparing both. 

Various aspects of the invention are described in further detail in the following 
subsections. 

I. Isolated Nucleic Acid Molecules 

20 One aspect of the invention pertains to novel isolated nucleic acid molecules that 

correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 

25 correspond to a marker of the invention, including nucleic acids which encode a 

polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g., those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules 

30 (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 
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An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid (i.e., sequences located at the 
5 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 

10 molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

1 5 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook et ai, ed., Molecular Cloning: A Laboratory 
Manual 2nd ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

20 A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 

or genomic DNA as a template and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 

25 invention can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
30 of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 
complementary to a given nucleotide sequence is one which is sufficiently 
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complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion 
of a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 
5 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 

10 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 

15 radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 
express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

20 The invention further encompasses nucleic acid molecules that differ, due to 

degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 
a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 

It will be appreciated by those skilled in the art that DNA sequence 
25 polymorphisms that lead to changes in the amino acid sequence can exist within a 

population (e.g., the human population). Such genetic polymorphisms can exist among 
individuals within a population due to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given genetic locus. In addition, it will be 
appreciated that DNA polymorphisms that affect RNA expression levels can also exist 
30 that may affect the overall expression level of that gene (e.g., by affecting regulation or 
degradation). 
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As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
5 marker of the invention. Such natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
variety of individuals. Any and all such nucleotide variations and resulting amino acid 

10 polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 

15 4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 
under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 

20 identical to each other typically remain hybridized to each other. Such stringent 

conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989). A 
preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 

25 annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C. 
Further preferred hybridization conditions are taught in Lockhart, et al, Nature 
Biotechnology, Volume 14, 1996 August: 1675- 1680; Breslauer, et al, Proc. Natl. Acad. 

30 Sci. USA, Volume 83, 1986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw, et al, BioTechniques, 
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Volume 8, No. 6 1990: 674-678; and Milner, et aL, Nature Biotechnology, Volume 15, 
1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of 
the invention that can exist in the population, the skilled artisan will further appreciate 
5 that sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 
the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 

10 sequence without altering the biological activity, whereas an "essential" amino acid 

residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
acid residues that are conserved among the homologs of various species (e.g., murine 

1 5 and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from the 
20 naturally-occurring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 
invention. 

25 An isolated nucleic acid molecule encoding a variant protein can be created by 

introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such as site-directed mutagenesis 

30 and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
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amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
5 serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 

10 and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid of the invention, e.g., 

15 complementary to the coding strand of a double-stranded cDNA molecule 

corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 
invention. The antisense nucleic acid can be complementary to an entire coding strand, 

20 or to only a portion thereof, e.g. , all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5* and 3' untranslated regions") are the 5' and 3' 
sequences which flank the coding region and are not translated into amino acids. 

25 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 

40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 

30 variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
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substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thioiiridine, 5- 
5 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, 5 ! -methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

10 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 

1 5 expression vector into which a nucleic acid has been sub-cloned in an antisense 

orientation RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
20 to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense 
25 nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an ovary-associated body fluid. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then 
30 administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
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to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 
nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
constructs in which the antisense nucleic acid molecule is placed under the control of a 
5 strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et al, 1987, Nucleic Acids Res. 15:6625- 

10 6641). The antisense nucleic acid molecule can also comprise a 2-o- 

methylribonucleotide (Inoue et al, 1987, Nucleic Acids. Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al, 1987, FEES Lett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic KNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 

1 5 nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 

20 invention can be designed based upon the nucleotide sequence of a cDNA 

corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al U.S. Patent No. 
4,987,071; and Cech et al U.S. Patent No. 5,1 16,742). Alternatively, an mRNA 

25 encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:141 1-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited 

30 by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide (e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
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(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. MY. Acad Sci. 660:27- 
36; and Maher (1992) Bioassays 14(12):807-15. 

In various embodiments, the nucleic acid molecules of the invention can be 
modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
5 stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 

1 0 pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
etal. (1996), supra; Peny-O'Keefe et al (1996) Proc. Natl Acad Sci. USA 93:14670- 

15 675. 

PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs can also be used, e.g., in the analysis of single base pair mutations in 

20 a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used 
in combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as 
probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- 
O'Keefe etal. t 1996, Proc. Natl. Acad. Sci. USA 93:14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 

25 cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g. , RNASE H and DNA polymerases, to interact with the DNA 

30 portion while the PNA portion would provide high binding affinity and specificity. 

PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
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1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hyrup (1996), supra, and Finn et al (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
5 such as 5 , -(4-methoxytrityl)amino-5 r -deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al, 1989, Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5 ! PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 

10 synthesized with a 5 r DNA segment and a 3' PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett. 5: 1 1 19-1 1 124). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. 

15 Set USA 86:6553-6556; Lemaitre et al, 1987, Proc. Natl Acad Set USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol etal, 1988, Bio/Techniques 
6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 

20 this end, the oligonucleotide can be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
region which is complementary to a nucleic acid of the invention, such that the 

25 molecular beacon is useful for quantitating the presence of the nucleic acid of the 

invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
portions of the nucleic acid in such an orientation that when the complementary regions 

30 are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
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one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

II Isolated Proteins and Antibodies 
5 One aspect of the invention pertains to isolated proteins which correspond to 

individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 

1 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 

15 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical, 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

20 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

25 also preferably substantially free of culture medium, Le. 9 culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, Le. 9 it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

30 preparations of the protein have less than about 30%, 20%, 1 0%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide of interest. 
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Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 
which include fewer amino acids than the full length protein, and exhibit at least one 
5 activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 

10 prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 

Preferred polypeptides are encoded by the nucleotide sequences of Tables 1-2. 
Other useful proteins are substantially identical (e.g., at least about 40%, preferably 
50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these sequences and retain the 

15 functional activity of the protein of the corresponding naturally-occurring protein yet 
differ in amino acid sequence due to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 

20 alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 

25 is a function of the number of identical positions shared by the sequences (i.e., % 

identity = # of identical positions/total # of positions (e.g., overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 

30 mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschul (1990) Proc. Natl Acad. Set USA 87:2264-2268, modified as in 
KarlinandAltschul (1 993) Proc. Natl Acad. Sci. USA 90:5873-5877. Such an 
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algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et ah 
(1990) J. Mol Biol. 215:403-410. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
5 be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et ah (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 

1 0 molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs {e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:1 1-17. Such an algorithm is incorporated into the 

15 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FASTA algorithm as described in Pearson and Lipman (1988) 

20 Proa Natl Acad. Sci. USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a &-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

25 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein" or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

30 polypeptide (i.e., a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
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other. The heterologous polypeptide can be fused to the ammo-terminus or the 
carboxyl-terminus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in which a polypeptide 
corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
5 sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 
sequence at its amino terminus. For example, the native signal sequence of a 
polypeptide corresponding to a marker of the invention can be removed and replaced 

10 with a signal sequence from another protein. For example, the gp67 secretory sequence 
of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 

15 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et al, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

20 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane-bound) and a protein on the surface of a cell 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

25 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating (e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

30 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 
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Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 
5 primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al., supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention can be cloned into such an 
10 expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 

A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 

15 during secretion in one or more cleavage events. Such signal peptides contain 

processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described > 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (i.e. , the cleavage products). In one 

20 embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 
in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 

25 be readily purified from the extracellular medium by art recognized methods. 
Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 

30 sequence which can function as either agonists (mimetics) or as antagonists. Variants 
can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist 
can retain, substantially the same, or a subset, of the biological activities of the naturally 



WO 01/070979 



PCT/US01/09126 



-44- 

occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively 
binding to a downstream or upstream member of a cellular signaling cascade which 
includes the protein of interest. Thus, specific biological effects can be elicited by 
5 treatment with a variant of limited function. Treatment of a subject with a variant 
having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

Variants of a protein of the invention which function as either agonists 

10 (mimetics) or as antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the protein of the invention for agonist or antagonist 
activity. In one embodiment, a variegated library of variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of variants can be produced by, for example, en2ymatically 

15 ligating a mixture of synthetic oligonucleotides into gene sequences such that a 

degenerate set of potential protein sequences is expressible as individual polypeptides, 
or alternatively, as a set of larger fusion proteins (e.g. , for phage display). There are a 
variety of methods which can be used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate oligonucleotide sequence. Methods for 

20 synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, 
Tetrahedron 39:3; Itakurae/a/., 1984, Annu. Rev. Biochem. 53:323; Itakuraefa/., 1984, 
Science 198:1056; Ike etal, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 

25 population of polypeptides for screening and subsequent selection of variants. For 
example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, removing single stranded portions 
from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
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derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. The most widely used 

techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 

10 isolation of the vector encoding the gene whose product was detected. Recursive 

ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl Acad. Sci. 
USA 59:781 1-7815; Delgrave et al, 1993, Protein Engineering 6(3):327- 331). 

15 An isolated polypeptide corresponding to a marker of the invention, or a 

fragment thereof, can be used as an immunogen to generate antibodies using standard 
techniques for polyclonal and monoclonal antibody preparation. The full-length 
polypeptide or protein can be used or, alternatively, the invention provides antigenic 
peptide fragments for use as immunogens. The antigenic peptide of a protein of the 

20 invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues 
of the amino acid sequence of one of the polypeptides of the invention, and encompasses 
an epitope of the protein such that an antibody raised against the peptide forms a specific 
immune complex with a marker of the invention to which the protein corresponds. 
Preferred epitopes encompassed by the antigenic peptide are regions that are located on 

25 the surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 
hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
30 vertebrate. An appropriate immunogenic preparation can contain, for example, 

recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
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fiirther include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 
5 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, i.e., molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

1 0 does not substantially bind other molecules in a sample, e.g., a biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments which can be generated 
by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 

1 5 "monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 

20 polyclonal antibody compositions are ones that have been selected for antibodies 

directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 

25 immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 

30 standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 
isolated from the subject (e.g., from the blood or serum of the subject) and further 
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purified by well-known techniques, such as protein A chromatography to obtain the IgG 
fraction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 
5 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, i.e., one that is 

10 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 
those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 

1 5 sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 
polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 

20 prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 
cell hybridoma technique (see Kozbor et al, 1983, Immunol Today 4:72), the EBV- 
hybridoma technique (see Cole et al, pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 

25 hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g., using a standard 
ELISA assay. 

30 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody directed against a polypeptide of the invention can be identified 
and isolated by screening a recombinant combinatorial immunoglobulin library (e.g., an 
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antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available (e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01; and the Stratagene 
SurfZAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods 
5 and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/18619; PCT Publication No. WO 91/17271; PCT Publication No. WO 
92/20791 ; PCT Publication No. WO 92/1 5679; PCT Publication No. WO 93/01288; 
PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 

10 Publication No. WO 90/02809; Fuchs et al. (1991) Bio/Technology 9:1370-1372; Hay et 
al (1992) Hum. Antibod. Hybridomas 3:81-85; Huse et al (1989) Science 246:1275- 
1281; Griffiths <?f al (1993) EMBO 1 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 

1 5 made using standard recombinant DNA techniques, are within the scope of the 

invention. A chimeric antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g., Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 

20 incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarity determining 
regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g., Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 

25 monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 
European Patent Application 184,187; European Patent Application 171,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al (1988) Science 240:1041- 

30 1043; Liu et al (1987) Proc. Natl Acad. Sci. USA 84:3439-3443; Liu et al (1987) J. 
Immunol 139:3521- 3526; Swietal (mi) Proc. Natl Acad Sci USA 84:214-218; 
Nishimuraefa/. (1987) CancerRes. 47:999-1005; Wooded al. (1985) Nature 314:446- 
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449; and Shaw et al (1988)./ Natl Cancer Inst. 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al. (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones etal (1986) Nature 321:552-525; Verhoeyanef al (1988) Science 239:1534; and 
Beidlerefa/. (1988) J Immunol 141:4053-4060. 
5 Antibodies of the invention may be used as therapeutic agents in treating 

cancers. In a preferred embodiment, completely human antibodies of the invention are 
used for therapeutic treatment of human cancer patients, particularly those having an 
ovarian cancer. Such antibodies can be produced, for example, using transgenic mice 
which are incapable of expressing endogenous immunoglobulin heavy and light chains 

10 genes, but which can express human heavy and light chain genes. The transgenic mice 
are immunized in the normal fashion with a selected antigen, e.g. , all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 

15 cell differentiation, and subsequently undergo class switching and somatic mutation. 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 

20 and protocols for producing such antibodies, see, e.g., U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 
5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

25 Completely human antibodies which recognize a selected epitope can be 

generated using a technique referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al, 
1994, Bio/technology 12:899-903). 

30 An antibody directed against a polypeptide corresponding to a marker of this 

invention {e.g., a monoclonal antibody) can be used to isolate the polypeptide by 
standard techniques, such as affinity chromatography or immunoprecipitation. 
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Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing 
5 procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 

10 phosphatase, P-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

15 materials include luciferase, luciferin, and aequorin, and examples of suitable 
radioactive material include * 25 I, 13I I, 35 S or 3 H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 
moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin 
or cytotoxic agent includes any agent that is detrimental to cells. Examples include 

20 taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites 

25 (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, 
melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 

30 doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anfhramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 
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The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
5 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as 
tumor necrosis factor, .alpha. -interferon, ,beta.-interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

10 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see. e.g., Arnon et aL, "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug 

1 5 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies '84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

20 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

25 Accordingly, in one aspect, the invention provides substantially purified 

antibodies or fragments thereof, and non-human antibodies or fragments thereof, which 
antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence selected from the group consisting of the amino acid sequences of the present 
invention, an amino acid sequence encoded by the cDNA of the present invention, a 

30 fragment of at least 1 5 amino acid residues of an amino acid sequence of the present 
invention, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of the present invention (wherein the percent identity is determined using the 
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ALIGN program of the GCG software package with a PAM120 weight residue table, a 
gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence which is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of the nucleic acid molecules of the present invention, or a complement 
thereof, under conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X 
SSC, 0.1% SDS at 65°C. In various embodiments, the substantially purified antibodies 
of the invention, or fragments thereof, can be human, non-human, chimeric and/or 
humanized antibodies. 

In another aspect, the invention provides non-human antibodies or fragments 
thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 15 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 
amino acid sequence of the present invention (wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue ■« 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence r 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 
complement thereof, under conditions of hybridization of 6X SSC at 45°C and washing 
in 0.2 X SSC, 0.1% SDS at 65°C. Such non-human antibodies can be goat, mouse, 
sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 
antibodies. 

In still a further aspect, the invention provides monoclonal antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
present invention, a fragment of at least 15 amino acid residues of an amino acid 
sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
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identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
5 or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0.1% SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 
to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 

1 0 cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extracellular domain of the amino acid sequences of the present invention. 

1 5 Any of the antibodies of the invention can be conjugated to a therapeutic moiety 

or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

20 The invention also provides a kit cQntaining an antibody of the invention 

conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 
pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 
sequence of the present invention, an amino acid sequence encoded by the cDNA of the 
nucleic acid molecules of the present invention, a fragment of at least 15 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 
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which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a P AMI 20 weight residue table, a gap length penalty of 12, and a 
gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
5 molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 
antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 
1 0 recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal. Optionally, antibodies are collected from the 
antibody-producing cell. 

15 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a polypeptide corresponding to a marker of 
the invention (or a portion of such a polypeptide). As used herein, the term "vector" 

20 refers to a nucleic acid molecule capable of transporting another nucleic acid to which it 
has been linked. One type of vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 

25 they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 

30 operably linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors (e.g., 
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replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell. This 
5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

10 of the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol.185, 

15 Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
25 expression of a polypeptide corresponding to a marker of the invention in prokaryotic 
(e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 
vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using 17 promoter regulatory sequences and T7 
30 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
5 vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 

1 0 from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

1 5 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc(Amann etal, 1988, Gene 69:301-315) and pET lld(Studier etal., p. 60-89, In 
Gene Expression Technology: Methods in Enzymology vol.185, Academic Press, San 
Diego, CA, 1991). Target gene expression from the pTrc vector relies on host RNA 

20 polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a 17 gnlO-lac fusion promoter 
mediated by a co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 
supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

25 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: Methods 
in Enzymology vol. 185, Academic Press, San Diego, CA, 1990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 

30 vector so that the individual codons for each amino acid are those preferentially utilized 
in£ co//(Wadaera/., 1992, Nucleic Acids Res. 20:2111-2118). Such alteration of 
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nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
5 al, 1987, EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et al, 1987, Gene 54:113-123), pYES2 (Invitrogen Corporation, 
San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., Sf 
10 9 cells) include the pAc series (Smith et aL, 1983, Mol. Cell Biol 3:2156-2165) and the 
pVL series (Lucklow and Summers, 1989, Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 
15 (Kaufman era/., \9%l,EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 1 7 of Sambrook et al, supra. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al, 
1987, Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton, 1988, 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBO J. 8:729-733) and immunoglobulins (Banerji etal, 1983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989, Proc. Natl Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,3 1 6 and European Application Publication No. 264,166). Developmentally- 
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regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss, 1990, Science 249:374-379) and the a-fetoprotein promoter (Camper 
and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
5 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 
manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mRNA encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 

1 0 orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 
regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid. or attenuated virus in which antisense nucleic 

1 5 acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
al, 1986, Trends in Genetics, Vol. 1(1). 

Another aspect of the invention pertains to host cells into which a recombinant 

20 expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 

25 identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic {e.g., E. coli) or eukaryotic cell {e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
30 conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
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phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al. {supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
5 expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

1 0 hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 

1 5 invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 

20 In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 

25 corresponding to a marker of the invention have been introduced. Such host cells can 
then be used to create non-human transgenic animals in which exogenous sequences 
encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 

30 are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 
used herein, a "transgenic animal" is. a non-human animal, preferably a mammal, more 
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preferably a rodent such as a rat or ifcouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 
5 ■ develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animal" is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

10 an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 

15 the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 

20 and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,191 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 

25 based upon the presence of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

30 To create an homologous recombinant animal, a vector is prepared which 

contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
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thereby alter, e.g., functionally disrupt, the gene* In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
"knock out" vector). Alternatively, the vector can be designed such that, upon 
5 homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5' and 3' ends by 
additional nucleic acid of the gene to allow for homologous recombination to occur 

10 between the exogenous gene carried by the vector and an endogenous gene in an 

embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5 1 and 3 f ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 

15 recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et al, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 

20 Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 
1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 

25 transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 

30 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PL For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al. (1992) Proc. 
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Natl Acad. Sci. USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O f Gorman et al, 1991, Science 
251:1351-1355). If a cre/loxP recombinase system is used to regulate expression of the 
transgene, animals containing transgenes encoding both the Cre recombinase and a 
5 selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
10 produced according to the methods described in Wilmut et al. (1997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 

15 as "active compounds") corresponding to a marker of the invention can be incorporated 
into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 

20 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 

25 incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 

30 or nucleic acid corresponding to a marker of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a pharmaceutically 



WO 01/070979 



PCT/US01/O9126 



-63- 

acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

The invention also provides methods (also referred to herein as 
5 "screening assays") for identifying modulators, i.e., candidate or test compounds or 
agents (e.g., peptides, peptidomimetics, peptoids, small molecules or other drugs) which 
(a) bind to the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect 
on the activity of the marker or, more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its natural substrates (e.g., peptide, 
10 protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 

expression of the marker. Such assays typically comprise a reaction between the marker 
and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

15 The test compounds of the present invention may be obtained from any available 

source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
20 peptide backbone which are resistant to enzymatic degradation but which nevertheless 
remain bioactive; see, e.g., Zuckermann et al, 1994, J. Med Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound 1 library method; and 
synthetic library methods using affinity chromatography selection. The biological 
25 library and peptoid library approaches are limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997, Anticancer Drug Des. 12:145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl Acad. Sci. U.S.A. 90:6909; Erb et 
30 al. (1994) Proc. Natl. Acad Sci. USA 91:11422; Zuckermann et al (1994). J. Med 
Chem. 37:2678; Cho etal (1993) Science 261:1303; Carrell etal. (1994) Angew. Chem. 
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Int. Ed. Engl 33:2059; Carell etal (1994) Angew. Chem. Int. Ed Engl 33:2061; and in 
Gallop etal. (1994) J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution {e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
5 1993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull et al, 1992, Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirlae/a/, 1990, 
Proc. Natl Acad. Sci. 87:6378-6382; Felici, 1991, J. Mol Biol. 222:301-310; Ladner, 
supra.). 

10 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 

15 example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
marker compound in a complex. For example, compounds {e.g., marker substrates) can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 

20 assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 

25 thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 
the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 

30 is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 
use of the marker protein as M bait protein" in a two-hybrid assay or three-hybrid assay 
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(see, e.g., U.S. Patent No. 5,283,317; Zervos etal, 1993, Cell 72:223-232; Madura et al, 
1993, J. Biol. Chem, 268:12046-12054; Bartel et al ,1993, Biotechniques 14:920-924; 
Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 
5 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

10 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 

1 5 fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
• dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 

20 the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 

25 or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 
peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 

30 in this embodiment is an ovarian cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 
supplied from any source. 
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The basic principle of the assay systems used to identify compounds that 
interfere with the interaction between the marker and its binding partner involves 
preparing a reaction mixture containing the marker and its binding partner under 
conditions and for a time sufficient to allow the two products to interact and bind, thus 
5 forming a complex. In order to test an agent for inhibitory activity, the reaction mixture 
is prepared in the presence and absence of the test compound. The test compound can 
be initially included in the reaction mixture, or can be added at a time subsequent to the 
addition of the marker and its binding partner. Control reaction mixtures are incubated 
without the test compound or with a placebo. The formation of any complexes between 

10 the marker and its binding partner is then detected. The formation of a complex in the 
control reaction, but less or no such formation in the reaction mixture containing the test 
compound, indicates that the compound interferes with the interaction of the marker and 
its binding partner. Conversely, the formation of more complex in the presence of 
compound than in the control reaction indicates that the compound may enhance 

1 5 interaction of the marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
binding partner may be conducted in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting complexes anchored to the solid phase at the end of the 

20 reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 
information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 

25 substance, i.e., by adding the test substance to the reaction mixture prior to or 

simultaneously with the marker and its interactive binding partner. Alternatively, test 
compounds that disrupt preformed complexes, e.g., compounds with higher binding 
constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 

30 various formats are briefly described below. 
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In a heterogeneous assay system, either the marker or its binding partner is 
anchored onto a solid surface or matrix, while the other corresponding non-anchored 
component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
5 of methods, either non-covalent or covalent, that are typically well known to one who 
practices the art. Non-covalent attachment can often be accomplished simply by coating 
the solid surface with a solution of the marker or its binding partner and drying. 
Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

10 In related embodiments, a fusion protein can be provided which adds a domain 

that allows one or both of the assay components to be anchored to a matrix. For 
example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 
transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 

1 5 combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter p] ate wells are washed to remove any unbound assay components, the 
immobilized complex assessed either directly or indirectly, for example, as described 

20 above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 
partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

25 marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 

succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 
advance and stored. 

30 In order to conduct the assay, the corresponding partner of the immobilized assay 

component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
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and any complexes formed will remain immobilized on the solid surface. The detection 
of complexes anchored on the solid surface can be accomplished in a number of ways. 
Where the non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the non- 
5 immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g. , using a labeled antibody specific for the 
initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 

10 complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 
This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 

15 determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

In such a homogeneous assay, the reaction products may be separated from 
20 unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifugation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifugation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 
different sedimentation equilibria of complexes based on their different sizes and 
25 densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 

Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 
appropriate gel filtration resin in a column format, for example, the relatively larger 
30 complex may be separated from the relatively smaller uncomplexed components. 

Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
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the remaining individual reactants, for example through the use of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, JMol Recognit. 1 1:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed. ScL AppL, 699:499-525). Gel electrophoresis 
5 may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 
New York. 1999). In this technique, protein or nucleic acid complexes are separated 
based on size or charge, for example. In order to maintain the binding interaction during 
the electrophoretic process, nondenaturing gels in the absence of reducing agent are 

1 0 typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 
utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1 999). In this technique, all proteins binding to an antibody specific to one of the 

1 5 binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifugation. The bound assay 
components are released from the beads (through a specific proteolysis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
the complex), and a second immunoprecipitation step is performed, this time utilizing 

20 antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 
interactions between the marker and its binding partner. 

25 Also within the scope of the present invention are methods for direct detection of 

interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 
Lakowicz etal, U.S. Patent No. 5,631,169; Stavrianopoulos et al, U.S. Patent No. 

30 4,868,1 03). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor 5 molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor 5 molecule (e.g., 
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marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 
Alternately, the 'donor' protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor' molecule label may be differentiated from that of the 
5 'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 

1 0 means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 
result in the generation of a signal variant to that of background. In this way, test 
substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 

15 In another embodiment, modulators of marker expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
mRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
to the level of expression of mRNA or protein in the absence of the candidate 

20 compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
marker mRNA or protein expression. Conversely, when expression of marker mRNA 

25 or protein is less (statistically significantly less) in the presence of the candidate 

compound than in its absence, the candidate compound is identified as an inhibitor of 
marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 
or protein. 

30 In another aspect, the invention pertains to a combination of two or more of the 

assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 
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marker protein can be further confirmed in vivo, e.g., in a whole animal model for 
cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
5 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., an marker modulating agent, an antisense 
marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 

10 herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 

15 ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 
sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 

20 molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 
about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 

25 subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 
about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 

30 can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 
activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
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researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 
5 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

10 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

15 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

20 enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

25 water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

30 example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 

polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
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of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
5 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

1 0 compound {e.g. , a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium, 
and then incorporating the required other ingredients from those enumerated above. In 

1 5 the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 

20 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

25 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be 

included as part of the composition. The tablets, pills, capsules, troches, and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder 
such as macrocrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a 

30 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

1 5 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example, as described in U.S. Patent No. 4,522,8 1 1 . 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
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and the particular therapeutic effect to be achieved, and the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
(generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
5 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration (e.g., into the ovarian epithelium). A method for 
1 0 lipidation of antibodies is described by Cruikshank et al (1 997) J. Acquired Immune 
Deficiency Syndromes and Human Retrovirology 14:193. 

The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by, for example, intravenous injection, local administration (U.S. 
15 Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et al, 1994, Proc. Natl. 
Acad, Sci. USA 91:3054-3057). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 
20 e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

25 V. Predictive Medicine 

The present invention pertains to the field of predictive medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 

30 assays for determining the level of expression of polypeptides or nucleic acids 

corresponding to one or more markers of the invention, in order to determine whether 
an individual is at risk of developing ovarian cancer. Such assays can be used for 
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prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs or other compounds administered either to inhibit ovarian cancer or to treat 
5 or prevent any other disorder {i.e. in order to understand any ovarian carcinogenic 
effects that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

10 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample (e.g. an ovary-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

15 the polypeptide or nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
vitro techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

20 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

25 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 

30 thus forming a complex that can be removed and/or detected in the reaction mixture. 
These assays can be conducted in a variety of ways. 
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For example, one method to conduct such an assay would involve anchoring the 
marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 
5 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 
1 0 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 
15 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 
20 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 
25 (e.g. , by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 

In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
30 directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
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It is also possible to directly detect marker/probe complex formation without 
further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
al, U.S. Patent No. 5,631,169; Stavrianopoulos, et al, U.S. Patent No. 4,868,103). A 
5 fluorophore label on the first, 'donor' molecule is selected such that, upon excitation 
with incident light of appropriate wavelength, its emitted fluorescent energy will be 
absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 

10 that emit different wavelengths of light, such that the 'acceptor' molecule label may be 
differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 

1 5 should be maximal. An FET binding event can be conveniently measured through 

standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 

20 (see, e.g., Sjolander, S. and Urbaniczky, C, 1991, Anal Chem. 63:2338-2345 and 
Szabo etal, 1995, Curr. Opin. Struct Biol. 5:699-705). As used herein, "BIA" or 
"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 

25 of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 
resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 

Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 

30 assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential 
centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
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differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Sci. 18(8):284-7). 
5 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

1 0 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. Mol. Recognit Winter 1 1(1- 

15 6):141-8; Hage, D.S., and Tweed, S.A. J Chromatogr B Biomed Sci Appl 1 997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel et al, ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

20 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 

In a particular embodiment, the level of mRNA corresponding to the marker can 

25 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 

30 against the isolation of mRNA can be utilized for the purification of RNA from ovarian 
cells (see, e.g. , Ausubel et al. , ed., Current Protocols in Molecular Biology, John Wiley 
& Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
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readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 
4,843,155). 

The isolated mKNA can be used in hybridization or amplification assays that 
5 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mKNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

10 of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

15 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Affymetrix gene chip array. A skilled artisan can readily 

20 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

25 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl Acad. Set 
USA, 88:189-193), self sustained sequence replication (Guatelli etal, l990,Proc. Natl 
Acad Set USA 87:1874-1878), transcriptional amplification system (Kwohefa/., 1989, 
Proc. Natl Acad Sci. USA 86:1 173-1 177), Q-Beta Replicase (Lizardi et al, 1988, 
Bio/Technology 6:1 197), rolling circle replication (Lizardi et al 9 U.S. Patent No. 

30 5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
detection schemes are especially useful for the detection of nucleic acid molecules if 
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such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3' regions 
of a gene (plus and minus strands, respectively, or vice-versa) and contain a short region 
in between. In general, amplification primers are from about 10 to 30 nucleotides in 
5 length and flank a region from about 50 to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mRNA does not need to be isolated from the ovarian cells 
prior to detection. In such methods, a cell or tissue sample is prepared/processed using 
10 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

As an alternative to making determinations based on the absolute expression 
level of the marker, determinations may be based on the normalized expression level of 

1 5 the marker. Expression levels are normalized by correcting the absolute expression 
level of a marker by comparing its expression to the expression of a gene that is not a 
marker, e.g., a. housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

20 sample, e.g. , a patient sample, to another sample, e.g. , a non-ovarian cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

25 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 

30 that marker. This provides a relative expression level. 
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Preferably, the samples used in the baseline determination will be from ovarian 
cancer or from non-ovarian cancer cells of ovarian tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in 
normal tissues as a mean expression score aids in validating whether the marker assayed 
5 is ovarian specific (versus normal cells). In addition, as more data is accumulated, the 
mean expression value can be revised, providing improved relative expression values 
based on accumulated data. Expression data from ovarian cells provides a means for 
grading the severity of the ovarian cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to 

10 a marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 
preferably an antibody with a detectable label Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. , Fab or F(ab ! ) 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 

1 5 encompass direct labeling of the probe or antibody by coupling (i.e. , physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 
or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 

20 detected with fluorescently labeled streptavidin. 

Proteins from ovarian cells can be isolated using techniques that are well known 
to those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 

25 York). 

A variety of formats can be employed to determine whether a sample contains a 
protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
30 readily adapt known protein/antibody detection methods for use in determining whether 
ovarian cells express a marker of the present invention. 
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In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
5 binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

One skilled in the art will know many other suitable carriers for binding antibody 
or antigen, and will be able to adapt such support for use with the present invention. For 

10 example, protein isolated from ovarian cells can be run on a polyacrylamide gel 

electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 
detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 

1 5 support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an , 
ovary-associated body fluid such as a urine sample). Such kits can be used to determine 
if a subject is suffering from or is at increased risk of developing ovarian cancer. For 

20 example, the kit can comprise a labeled compound or agent capable of detecting a 
polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the 
invention in a biological sample and means for determining the amount of the 
polypeptide or mRNA in the sample (e.g., an antibody which binds the polypeptide or an 
oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide). Kits 

25 can also include instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 
marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 

30 For oligonucleotide-based kits, the kit can comprise, for example: (1) an 

oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
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a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
marker of the invention. The kit can also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label (e.g., an enzyme or a substrate). The kit can 
5 also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 



10 B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 
of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) ovarian cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype 

] 5 and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents (e.g., drugs) for prophylactic or therapeutic treatments 

20 based on a consideration of the individual's genotype. Such pharmacogenomics can 
further be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, tlie level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

25 Pharmacogenomics deals with clinically significant variations in the response to 

drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., 
Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 

30 conditions transmitted as single factors altering the way the body acts on drugs are 

referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
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dehydrogenase (G6PD) deficiency is a common inherited enzymopathy in which the 
main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
5 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

10 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

15 experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 

20 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 

25 identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 
and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 



30 



C. Monitoring Clinical Trials 

Monitoring the influence of agents (e.g., drug compounds) on the level of 
expression of a marker of the invention can be applied not only in basic drug screening, 
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but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for ovarian 
cancer. In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
5 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the 

10 level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent can be desirable to increase expression of the 

15 marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease . 
expression of the marker(s) to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

20 D. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising a marker of the present 
invention is also provided. As used herein, "electronic apparatus readable media" refers 
to any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 

25 not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 
these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon a marker of the present invention. 

30 As used herein, the term "electronic apparatus" is intended to include any 

suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use with the 
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present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 
electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 
5 As used herein, "recorded" refers to a process for storing or encoding 

information on the electronic apparatus readable medium. Those skilled in the art can 
readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the markers of the present invention. 
A variety of software programs and formats can be used to store the marker 

10 information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in 
a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

15 forms. Any number of dataprocessor structuring formats (e.g. , text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the the , 
markers of the present invention. 

By providing the markers of the invention in readable form, one can routinely 
access the marker sequence information for a variety of purposes. For example, one 

20 skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 
target sequence or target motif. 

25 The present invention therefore provides a medium for holding instructions for 

performing a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer, wherein the method comprises the steps of determining 
the presence or absence of a marker and based on the presence or absence of the marker, 
determining whether the subject has ovarian cancer or a pre-disposition to ovarian 

30 cancer and/or recommending a particular treatment for ovarian cancer or pre-ovarian 
cancer condition. 
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The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer associated with a marker wherein the method comprises 
the steps of determining the presence or absence of the marker, and based on the 
5 presence or absence of the marker, determining whether the subject has ovarian cancer 
or a pre-disposition to ovarian cancer, and/or recommending a particular treatment for 
the ovarian cancer or pre-ovarian cancer condition. The method may further comprise 
the step of receiving phenotypic information associated with the subject and/or acquiring 
from a network phenotypic information associated with the subject. 

1 0 The present invention also provides in a network, a method for determining 

whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated 
with a marker, said method comprising the steps of receiving information associated 
with the marker receiving phenotypic information associated with the subject, acquiring 
information from the network corresponding to the marker and/or ovarian cancer, and 

1 5 based on one or more of the phenotypic information, the marker, and the acquired 

information, determining whether the subject has a ovarian cancer or a pre-disposition to . 
ovarian cancer. The method may further comprise the step of recommending a 
particular treatment for the ovarian cancer or pre-ovarian cancer condition. 

The present invention also provides a business method for determining whether a 

20 subject has ovarian cancer or a pre-disposition to ovarian cancer, said method 

comprising the steps of receiving information associated with the marker, receiving 
phenotypic information associated with the subject, acquiring information from the 
network corresponding to the marker and/or ovarian cancer, and based on one or more 
of the phenotypic information, the marker, and the acquired information, determining 

25 whether the subject has ovarian cancer or a pre-disposition to ovarian cancer. The 
method may further comprise the step of recommending a particular treatment for the 
ovarian cancer or pre-ovarian cancer condition. 

The invention also includes an array comprising a marker of the present 
invention. The array can be used to assay expression of one or more genes in the array. 

30 In one embodiment, the array can be used to assay gene expression in a tissue to 

ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes 
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can be simultaneously assayed for expression. This allows a profile to be developed 
showing a battery of genes specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 
5 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is useful, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 
expression in a second tissue can be determined. In this context, the effect of one cell 

10 type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 

1 5 opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
determined at the molecular level. Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course o£ 

20 expression of one or more genes in the array. This can occur in various biological 

contexts, as disclosed herein, for example development of ovarian cancer, progression of 
ovarian cancer, and processes, such a cellular transformation associated with ovarian 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
25 the expression of other genes in the same cell or in different cells. This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
30 serve as a molecular target for diagnosis or therapeutic intervention. 
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E. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, ovarian cancer. As used herein, a "surrogate marker" is an objective 
5 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 

10 particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 
of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 

15 RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 
markers in the art include: Koomen et al. (2000) J. Mass. Spectrom. 35: 258-264; and 
James (1994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 

20 used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 
pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 

25 pharmacodynamic marker may be indicative of the concentration of the drug in a 

biological tissue, in that the marker is either expressed or transcribed or not expressed or 
transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 

30 the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 
Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
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of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
readily detectable than the drug itself. Also, the marker may be more easily detected 
5 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 

1 0 observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al (1991) Env. Health Perspect. 90: 229-238; 
Schentag (1999) Am. J. Health-Syst Pharm. 56 Suppl. 3: S21-S24; and Nicolau (1999) 
Am, J. Health-Syst Pharm. 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 

1 5 used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e.g., 
McLeod et al (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a 
specific drug or class of drugs prior to administration of the drug. By assessing thei 

20 presence or quantity of one or more pharmacogenomic markers in a subject, a drug :. 
therapy which is most appropriate for the subject, or which is predicted to have a greater 
degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 

25 present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 
pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 
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VL Experimental Protocol 

A. Subtracted Libraries 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mRNA (tester) 
5 compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 
SMART PCR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 

10 abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

To create the subtracted libraries, a first group of regular cDNA libraries was 
constructed. Library johOa was constructed from a pool of 5 normal ovarian epithelial 

15 cell cultures. Library johOb was constructed from a pool of 5 ascites short cultured 
samples from ovarian cancer patients. Library johOc was constructed from a pool of 6 
serous late stage (III/IV) tumor samples. Three subtracted libraries were generated from 
tumor samples. Library johOd was a subtracted ascites library, where the tester was 
johOb, and the driver was johOa. The johOe and the johOf library were both subtracted 

20 stage III/IV serous tumor libraries. The tester for both of these libraries was johOc, and 
the driver was a pooled RNA from normal tissues. The tissues used for this driver pool 
were: kidney, small intestine, prostate, lung, heart, muscle, spleen, pancreas, liver, and 
lymphocyte. Library cMhOg was the same as the johOc and johOf libraries, with the 
exception that normal ovary was added to the driver. cMhOh, i, j, and k are all stage I/II 

25 subtracted libraries made from pooled tumor RNAs of different histological types 

(h=serous, I-endometriod, j=clear cell, k==mucinous). The driver was the same for these 
4 libraries. It consisted of normal ovarian epithelial RNA and PBML RNA. Of the 
markers listed in Table 1, SEQ IDNOS: 1-129, 916-1029, 1566-1571 and 1607-1865 
were identified in library johOa. Markers identified in johOb include SEQ ID NOS: 

30 130-177, 1030-1081, 1572-1574, and 1866-1974. Markers identified in johOc include 
SEQ ID NOS: 178-269, 1082-1120, 1575-1577, and 1975-2060. Markers identified in 
johOd include SEQ ID NOS: 270-370, 1121-1304, 1578-1592, and 2061-2244. Markers 
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identified in johOe include SEQ ID NOS: 371-61 1, 1305-1416, 1593-1596 and 2245- 
2487. Markers identified in johOf include SEQ ID NOS: 612-915, 1417-1565, 1597- 
1606, and 2488-2871 . Of the markers listed in Table 1 A, SEQ ID NOS: 2872-2976, 
3817-3898, 4438-4443 and 4474-4675 were identified in library cMhOg. Markers 
5 identified in cMhOh include SEQ ID NOS: 2977-3376, 3899-4072, 4444-4455, and 
4676-5303. Markers identified in cMhOi include SEQ ID NOS: 3377-3495, 4073-4158, 
4456-4460, and 5304-5637. Markers identified in cMhOj include SEQ ID NOS: 3496- 
3742, 4195-4390, 4461-4468, and 5638-6197. Markers identified in cMhOk include 
SEQ ID NOS: 3743-3816, 4391-4437, 4469-4473 and 6198-6398. 

0 

VII. Summary Of The Data Provided In The Tables 

Tables 1, 1A, 2 and 3 are being filed concurrently herewith on a compact disc in 

lieu of paper copies. The compact disc submitted is formatted from an IBM-PC and is 

compatible with MS-Windows. The disc contains the following four (4) files: 
5 Tablel.text, containing l,223kb, TablelA.text, containing l,582kb, Table2.text, . 

containing 10,600kb, and Table3.text, containing 568kb. The material on the compact 

disc, namely Tables 1, 1A, 2 and 3, is expressly incorporated by reference. 

Tables 1 and 1A show 6398 novel nucleotide sequences. These 6398 novel 

sequences were determined to be novel through various BLAST searches of available 
0 databases. Of these novel markers, SEQ ID NOS: 1566 - 1606 and 4438-4473 are 

preferred, SEQ ID NOS: 916-1565 and 3817-4437 are more preferred, and SEQ ID 

NOS: 1 - 915 and 2872-3816 are most preferred. 

The sequences of Tables 1 and 1A were re-interpreted and vector sequences 

removed and those sequences are set forth in Table 2. 
5 Table 3 correlates the SEQ ID NOS. from Tables 1 and 1 A with those of Table 

2. 

The contents of all references, patents, published patent applications, and 
databases cited throughout this application are hereby incorporated by reference. 



Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
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described herein. Such equivalents are intended to be encompassed by the following 
claims. 

What is claimed is: 

5 
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Claims 

i 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence of 
Tables 1-2, or a complement thereof. 

5 

2. A vector which contains the nucleic acid molecule of claim 1 . 

3 . A host cell which contains the nucleic acid molecule of claim 1 . 

10 4. An isolated polypeptide which is encoded by a nucleic acid molecule 

comprising a nucleotide sequence of Tables 1-2. 

5. An antibody which selectively binds to a polypeptide of claim 4. 

15 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 

7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

20 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 4 in the 
sample. 

25 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 
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9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 
5 b) determining whether the nucleic acid probe or primer binds to a 

nucleic acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. 

10. The method of claim 9, wherein the sample comprises mRNA molecules 
1 0 and is contacted with a nucleic acid probe. 

1 1 . The method of claim 9, wherein the sample is isolated from ovarian 

tissue. 

15 12. The method of claim 9, wherein the sample is a tumor sample. 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 

20 14. A method of assessing whether a patient is afflicted with ovarian cancer, 

the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

25 b) the normal level of expression of the marker in a control non- 

ovarian cancer sample, 

wherein a significant difference between the level of expression of the marker in 
the patient sample and the normal level is an indication that the patient is afflicted with 
ovarian cancer. 

30 

1 5. The method of claim 14, wherein the marker corresponds to a secreted 

protein. 
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1 6. The method of claim 1 4, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 



1 7. The method of claim 14, wherein the sample comprises cells obtained 
5 from the patient. 

1 8. The method of claim 1 7, wherein the sample is an ovarian tissue sample. 

19. The method of claim 14, wherein the sample is an ovary-associated body 

10 fluid. 

20. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein or protein 
fragment corresponding to the marker. 

15 

2 1 . The method of claim 20, wherein the presence of the protein or protein 
fragment is detected using a reagent which specifically binds with the protein or protein 
fragment 

20 22. The method of claim 21 , wherein the reagent is selected from the group 

consisting of an antibody, an antibody derivative, and an antibody fragment. 

23. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 
25 polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 



30 



24. 
mRNA. 



25. 

cDNA. 



The method of claim 23, wherein the transcribed polynucleotide is an 
The method of claim 23, wherein the transcribed polynucleotide is a 
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26. The method of claim 23, wherein the step of detecting further comprises 
amplifying the transcribed polynucleotide. 



27. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 
polynucleotide which anneals with the marker or anneals with a portion of a 
polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 



10 28. The method of claim 14, wherein the level of expression of the marker in 

the sample differs from the normal level of expression of the marker in a patient not 
afflicted with ovarian cancer by a factor of at least about 2. 

29. The method of claim 14, wherein the level of expression of the marker in 
1 5 the sample differs from the normal level of expression of the marker in a patient not 

afflicted with ovarian cancer by a factor of at least about 5. 

30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of 
20 markers independently selected from the markers listed in Tables 1 -2, and 

b) the normal level of expression of each of the plurality of markers 
in samples of the same type obtained from control humans not afflicted with 
ovarian cancer, 

wherein the level of expression of more than one of the markers is significantly 
25 altered, relative to the corresponding normal levels of expression of the markers, is an 
indication that the patient is afflicted with ovarian cancer. 



3 1 . The method of claim 30, wherein the level of expression of each of the 
markers is significantly altered, relative to the corresponding normal levels of 
30 expression of the markers, is an indication that the patient is afflicted with ovarian 
cancer. 
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The method of claim 30, wherein the plurality comprises at least three of 

33. The method of claim 30, wherein the plurality comprises at least five of 
5 the markers. 

34. A method for monitoring the progression of ovarian cancer in a patient, 
the method comprising: 

a) detecting in a patient sample at a first point in time, the expression 
10 of a marker, wherein the marker is selected from the group consisting of the 

markers listed in Tables 1-2; 

b) repeating step a) at a subsequent point in time; and 

c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of ovarian cancer. 

15 

35. The method of claim 34, wherein the marker corresponds to a secreted 

protein. 

36. The method of claim 34, wherein the marker corresponds to a transcribed 
20 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. - 

37. The method of claim 34, wherein the sample comprises cells obtained 
from the patient. 

25 38. The method of claim 37, wherein the patient sample is an ovarian tissue 

sample. 
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the markers. 



39. The method of claim 34, wherein between the first point in time and the 
subsequent point in time, the patient has undergone surgery to remove ovarian tissue. 

30 
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40. A method of assessing the efficacy of a test compound for inhibiting 
ovarian cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient 
and exposed to the test compound, wherein the marker is selected from the group 

5 consisting of the markers listed in Tables 1-2, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
1 0 efficacious for inhibiting ovarian cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

1 5 42. The method of claim 40, wherein the first and second samples are '. 

portions of pooled samples obtained from the patient. 

43. A method of assessing the efficacy of a therapy for inhibiting ovarian 
cancer in a patient, the method comprising comparing: 
20 a) expression of a marker in the first sample obtained from the 

patient prior to providing at least a portion of the therapy to the patient, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) expression of the marker in a second sample obtained from the 
25 patient following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the second 
sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 
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44. A method of selecting a composition for inhibiting ovarian cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately exposing aliquots of the sample in the presence of a 
5 plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2; and 

d) selecting one of the test compositions which alters the level of 
10 expression of the marker in the aliquot containing that test composition, relative 

to other test compositions. 

45. A method of inhibiting ovarian cancer in a patient, the method 
comprising: 

1 5 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 

20 1-2; and 

d) administering to the patient at least one of the test compositions 
which alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 

25 46. A kit for assessing whether a patient is afflicted with ovarian cancer, the 

kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-2. 

47. A kit for assessing the presence of ovarian cancer cells, the kit 
30 comprising a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 
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48. A kit for assessing the suitability of each of a plurality of compounds for 
inhibiting ovarian cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the 
5 group consisting of the markers listed in Tables 1-2. 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with ovarian cancer, the method 
comprising: 

0 isolating a protein or protein fragment corresponding to a marker selected 

from the group consisting of the markers listed in Tables 1-2; 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
5 hybridomas; and 

screening individual hybridomas for production of an antibody which 
specifically binds with the protein or protein fragment to isolate the hybridoma. 

50. An antibody produced by a hybridoma made by the method of claim 42. 

0 

51. A kit for assessing the presence of human ovarian cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein or 
protein fragment corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 



52. A method of assessing the ovarian cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of ovarian cells in the presence and 
absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, 
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wherein a significantly altered level of expression of the marker in the aliquot 
maintained in the presence of the test compound, relative to the aliquot maintained in the 
absence of the test compound, is an indication that the test compound possesses human 
ovarian cell carcinogenic potential. 

5 

53. A kit for assessing the ovarian cell carcinogenic potential of a test 
compound, the kit comprising ovarian cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
Tables 1-2. 

10 

54. A method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer, the method comprising inhibiting expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

15 55. A method of treating a patient afflicted with ovarian cancer, the method 

comprising providing to cells of the patient an antisense oligonucleotide complementary . 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 
1-2. 

20 56. A method of inhibiting ovarian cancer in a patient at risk for developing 

ovarian cancer, the method comprising decreasing expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

57. A method for determining whether ovarian cancer has metastasized in a 
25 patient, the method comprising comparing: 

a) the level of expression of a marker in a patient smaple, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) the normal level or non-metastatic level of expression of the 
30 marker in a control sample 
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wherein a significant difference betweent he level of expression in the patient 
sample and the normal level or non-metastatic level is an indication that the ovarian 
cancer has mestastasized. 

5 58. The method of claim 57, wherein the marker corresponds to a secreted 

protein. 

59. The method of claim 57, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

10 

60. The method of claim 57, wherein the sample comprises cells obtained 
from the patient. 

6 1 . The method of claim 60, wherein the patient sample is an ovarian tissue 

15 sample. 

62. A method for assessing the aggressiveness or indolence of ovarian cancer 
comprising comparing: 

a) the level of expression of a marker in a sample, wherein at least 
20 one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control sample, 
wherein a significant difference between the level of expression in the sample 

and the normal level is an indication that the cancer is aggressive or indolent. 

25 63 . The method of claim 62, wherein the marker corresponds to a secreted 

protein. 

64. The method of claim 62, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

30 

65. The method of claim 62, wherein the sample comprises cells obtained 
from the patient. 
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66. The method of claim 65, wherein the patient sample is an ovarian tissue 
sample. 
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Sequence 1 

ACGCGTCCGGGAGACACAAAGCAGGAAGCCTCCGGGAGACCAGAGCTGGGTGCAGACATA 

CACACACACATACACACAGACACACAGAGTCACACACACTCACACACACTCTCTCTCTCT 

CTCCCTCTGTCTTTCTCTCTCTCTCTCTCTCTCTGTCCTTTCCTCCTGGACAGATCCACA 

GTTATACACAGAAACAAACACACACGCACGTAGAGAAGTGATTCACAAACACTTAAAGAC 

ATAAATCACAGGTGCAAAGCCATACCTGGGCTCAAAAACCTCCAAGAGAACGCAGCCTCA 

GACCCACCCAGGGGCCAGGGGCCAGGCTGTTTGCGAGACCAGCCCAGGCGGGACCCAGGC 

TTGCACGGGCAGGTACACGACATTCTTGGGCATACGCAGCCCGCCTGGCCGGAGCTGTGG 

GAGTCCTCAGCCCCAAGACCCAGCAGGCGTCTGAGGCCTGCCCACTAAGGAGGAGGAGTC 

ATTGCTGCCATCATTTATCATNCCCTCCCCAGCCACAGTCTGAGGAGGCCNTGTNCACCC 

TTCCCA 

Sequence 2 

CGTCCGGAAAGCTGGTGGGAATGCTAAGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTA 

CGATCCTTGCTGGTCCTTGCAGGAGAAGTACAACAGCAGAGCCGCGGCCCTCTTTAGGGA 

TAAGGTGGTCGCTCTGGCCGAAGGCAGAGAGTGGTCTTTGGAGTCATCACCTGCCCAGAA 

CTGGACCCTACCTNAGCCCANGACGCTGCCGTCCATGGTGCACCGAGTCTCTGGCCAGCC 

GCAGAGTGTGACCGCCTCCTNGGACAAGGCTTTTGAAGACTGGCTGAATGATGACCTCGG. 

CTCCTATCAAGGGGCCCAGGGGAATCGCTACGTGGGGTTTGGGAACACGCCACCGCCTCA 

NAAGAAAGAAGATGACTTTCTCAACAACGCCATGTCCTCCCTGTACTCGGGCTGGA 

Sequence3 

NCCACGCGTCCGGGACGCGGGCGCCAGGTGCACAGCCCCAGTCCGCTGCGGGCGGGCGTC 

GACATCTGCCGCGTGAGCGCGAGCTGGAGCTACACCGCTTTCGTGACCCGTGGAGGCCGC 

TTGGAGCTGTCGGGCTCAGCCAGCGGCGCGGCGGGCCGCTGCAAGGACGCGTGGGCCTCG 

GAGGGGCTCCTCGCGGTGCTGCGCGCCGGGCCGGGGCCGGAGGCGTTACTGCAGGTCTGG 

GCGGCCGAATCGGCGCTGCGTGGGGAGCCATTGTGGGCCCAGAATGGTGGTGCCCGAGGC 

cgaagggggaaggacgatccggccgggtgaaggcccaagcttggggaggctaccccttgc 
ttgccctgcggccccgtgcccctacgtgagcccccgcgggccgcccttctaccggccttt 
tggcttccgggagcttgcggggcaacgccaacttgggagctggggccgccagcaaccccg 
ttgcttgttggacgctgVgcccaaaggtggtttctggggggcgggg 

Sequence 4 

agtcnccacgcgtccgggaattgangccgcggggcgggcggcggcggcggctgggcggcg 

gccgggacccagcgggccaggtggggacggcgcggagcgggtgcgggagatgccgtgcgg 

gactggggccacctgagccgcccgcctcgtccccgccttctgtgggaaggatgtgcgcgc 

ggatggccggtcgcacaacagcggcccctcgggggccctacggcccctggctctgcctcc 

tggtggccctcgccctgggaccgtcgtgagagtggactgtggccaggctcccctggaccc 

tgtctacctgccggcagccctggagctcctagacgcccctgaacact 

Sequence 5 

TCNCCACGCGTCCGACTGTATGTATTCTGGATACAGGGGATACTGGGCTCGCTATGTGTG 

TGGAGCCATCCCTTCCTTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTC 

TGAGCTTCCTTTCCTGCTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTG 

CTACTTCGGGTCAGGTGGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACT 

TCAAGAGGAGAGGCTGCAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCCGTGTC 

ACTTTACTTTTGTACACAAGGGGGTCTCCTTAGTGCCCTCGAGAAGGGATTCTTGGGCeC 

TGAGCTTCTACTCCTGAGGCCACCTTCTGTGCAAGCCCCAAGCTCCCTCAACTCTAGGCT 

TGGAGTCCTCAGTGGGGAAAAGCCCTGNTTTGGGG 

Sequence 6 

CGTCCGCGTCCTCGCTGCGGAAAGTTGGGGCAACCTGTTGCTAGTCTGGTCGTTGGTGAC 
AGCGAGGCTTCCGCGCTCGCtGCTGGTGAGCAGCCCCGGCGTGCCCCGCGGGCTGGAAGA 
GGCGGCGGCGTGATGCGGCCCGTGGACGCGCCCCGCGCCGGCCCGCGAGAACCTGGCCTC 
CCTGGAGCGCGAGCGCGCCCGGGCGCACTGGCGGGCCCGCAGGAAGCTGCTGGAGATCCA 
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GAGCCTGCTCGACGCCATCAAGAGTGAGGTGGAGGCAGAGGAGCGGGGCGCCCGGGCCCC 

AGCACCCCGCCCCGCGTGCGGAGGCTGAGGAGNCGGGTGGCTCGGCTGTGCGCCCGAAGC 

AGAGAGGAAGGCTGCGGAAGCGGCGCGGATGGGGCAGGCGGGATCGTGGGAGCTGCACNN 

ACCGGATCGCCGGCTTGCGAGTGCTGCTGAGCCGGCGAGGCCCNCGGGTCTGGAAGCGGA 

ANCGCGCGGG 

Sequence 7 

NTTCGGGAGTCGACCACGCGTCCGCGCGCTGGAGGAGTGGAGCAAGCACCCGGCCCGGCC 

CTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAG 

GTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCG 

CGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGG 

CGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTTGCC 

GCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 

CCCCCTGGGGAAGGCGGCACCAGGGGAGCCTTGGGCGCCCNGGGCTTCGGCCGCGACCCC 

ATTTGGGGTAGACCACAAGAAAGCTTCGGGACCCTTTCGGCACCTTTGGACAGCCAAGAA • 

TGGCTGNTGGGCACCCTTTCTTCT 

Sequence 8 

CCCCGCGTCCGGAGCACGCAAAGGGAATAAATTGTAATTAGGTGGTGGGTGGCTAAAAAT 

GACAATGCAAAGGTGTTGGATTAAAAAAAAAATCTGGTAGTAGAGGGAAATTATGGAGGA 

I I I I I AAAAAGGTTAATGATAATATCCATCTACTTATGTAAC I I I I I I I GGAGAJACCTG 

ATAATAGTGTAGAGTGCATTGGAGAGGAAAAGTAGGAGTTGTAAAGACCATTTTGGATAA 

ACTTTGAAGCAAGGGATAATGGCCTCAACCAAGGTAGTGGTGCTGAAGATTGTTTACATA 

MTAAGCAGATACAMTAGAAGGGATTTTTCAAGTGGCATTGTAACTGCACTTTTCAAAG 

GTTATTTGCCAAAAATCAAATTAACGGTATCTTCAAAAATCATGTTTGATGGATGTATCA 

TCAAGGGCTTTCTTAAAATTTTGTTGAAAGCCAAGGAA 

Sequence 9 

CGCCTTCCCGGGAAGTTTGGAGGGCCCCGNAGGGGAAGCCCCCGCGNCTTCNGGGGGCCN 

GNCGGGCTTGGAAGGCAANCCCCACCCCAAGTTTCCCCGCCNANGGANTNCAATGAANCT 

TGACCGGGGCCCCCCGGAACCCCNCGCTNGNCTTNTTNGGGGGTGGTTCCTTGGGTTCCG 

GTGGGGGGGAACCCCAAGTGCTTTTTCAAGGCCCCGGCCGGGCGCGGGGCCCCGGAAAGG 

GCCTTTCAAGTTCTTNCCTTTCCCCGGNTTGAAAGAAGGNAAAAGGCCGGGAANGGAAAC 

CNGGGNAAAACCCGCNGGGNNGGGGCGGCCTTCNNCCGCCNGGGCGCCCCTTGCCNGGGG 

GGGGGNAAAGGGGNCAAAGTTTTCCCNGGGGCCCCCGGGGCCCCGGCNGNCCCTTTNAAN 

TCAAGGGGGCCGGGNNCGGNCTTTCCCCAANNCGGCCAAGTTCCTTCAAAGGGGCCCCCC 

CGGGNTTTGGGCCCGGNCCGGGNCNGNAACCTTGGGGAAGGAAAAAATTCAAAAAGTTTT 

GTTGGCCCGGTTTCCGGGTNGGAATTGCCCCCCNAAAATTTGNAAGCCGGGGGGGGGGNN 

NCCTTGGGGGCCCCTTCTTTGCCCCCNTTTAAAGGGAANGGGCNAAAACCTTTNCCCAAC 

CGNCCAANNGCCCCCGTNAAAAAANGGGCCGGCC7TNNTTTTGNCCGGGGCCCCNAANAA 

GGGCCTTTTCGN I I I I I ICGGGI I 1 1 I I CCCCCGGGCCGGGCGGGGCGGNNNCN I I 1 1 I I 

I I I I I GGCTTTTAAAGGGGGG 

Sequence 10 

NCGCGTCCGCCGCATTGTGGCCAAGTGCCATGAGGAGCAGCTGGATCATTCTGTCCAGTC 

ATATATTAAGTTCGTGTTCAAGACCAGGGCATGCAAGGAGAGGACTGTACATGAGGAACT 

GNCTAAAAATGTGACTGGTCTTTTGAAATCAAATGACTCAACAACAGTAAAGCATGTCCT 

AAAGCATTCCTGGTTCTTCTTTGCAATTATCCTAAAATCGATGGCACAGCACTTGATTGA 

CACAAATAAAATCCAGCTTCCCCGGCCTCAGAGATTTCCTGAATCTTACCAAAATGAATT 

GGACAATCTTGNCATGGTCCTATCCGACCATGTGATTTGGGAAATACAAGGATGCACTTG 

AAGAAACANGAAGGGCAAACCACAGCGTTGCCAGATTTCTCAAGCGCTGCTTTAC 

Sequence 11 

CGCCNCGCGTCCGGCTTCCTAGAAGAGCACAGTCCCTTAAAGCACCCTCTATTGCTACAA 

TTAAMGTCTAGCAGATTGTMCTTTAGTTACACAAGTTCTAGAGATGCTTTTGGCTATG 

CTACACTGAAAAGACTACAGCAACAAAGAATGCATCCATCCTTATCTCACTCTGAAGCTT 




WO 01/070979 



PCTYUS01/09126 



TABLE 1 
3/467 



TGGCATCTCCAGCAAAAGATGTGCTATTTACTGATACCATCACCATGAAGGCCAACAGTT 

TTGAGTCCAGATTAACACCAAGCAGGTTCATGAAAGCCTTAAGTTATGCATCATTAGATA 

AAGAAGATTTATTGAGTCCTATTAATCAAAATACCCTGCAACCGATCTTCCTCAGTGCGG 

GCCATGGNGTCCAGTGCCACATNGGGGGGGTCAGAATGATTACATTGGGCTTGCTCTCCC 

GGNGGATATAAATGATATATTTCANGGTMGGGTATTTCTTATTTTTAGACAAAAAACAT 

CCCCNCATGATGATCCAGGNGCCAGAGCATTTGCCCTGAATGCAGGAGGGCTTTCATNTG 

GNACTGGGNGGGCTTTGNAAAAAATTTTTT 

Sequence 12 

TTCGGGAGTCGACCACGCGTCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGA 

CGAGGGCCGGGTTCTGCGGCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAG 

CTTCTTCACCGACGGGTCCTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTC 

TAAGCGACACTCAACCTCTGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGG 

CTGGTTGTATTATAAGCAGATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACAT 

GCTGGGCTGGATCAGAGCGATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCCGGCT 

GTGCCAACCAAGCTCTTGATCAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGC 

TCTGGGCCCAAAGCTTGATTTCTTCCC 

Sequence 13 

GTATTAATGTTCTCAGGCATGAAGCAGAATTTTACGGGATCACTCCATTAGTAAGAAGGC 

TTCTCTTATGTGAAGAATTGGAGCGTTCCTCTTGTGGCAGTGTCCI I I I I CATGGTTACT 

TGCCCCCACCAGGTATTCCTAGTNCGTAAAATAAACAACACAGTCAGATCTGCTGATTCT 

AGGAATGGTCTAAATTCTACAGAAGGTGAAGCCCGGGGAAATGGTACACAGCCTGTTCTC 

TCTGGAACGGGAGAAGAAACTGTTAGGCTAGGATTTCCTGTGGATCCACNAAAGGTGCTA . 

ATAGTAGCTGGCCATCACAACTGGATTGTAGCTGCATATGCCCATTTTGCTGTGTGTTAC 

AGAATCAAAGAATNTTNANGATGGCAGCAAGTGNTTACGAGCCCATATTTGGATTGGACT 

ATCGAACGAGTAGNTTTAAATGCAAAGGTGGATGGAGGGCCACATGGAGACAAAAGACAA 

AAATG 

Sequence 14 

GCCNCGCGTCCGAAAAAATTAAGAGAAGGCCTGGCGGCCGGTCTGAAGTCATCTATAATT 

ATGTACAACGCCCCTTCATCCAGATGTCATGGGAAAAGGAAGAAGGGAAGAGTCGCCATG 

TGGATTTCTAGTGTGTTCGAAGCAAATCCCTCACGAATCTGGTAGCTGCTGGAGATGATG 

TCTTGGAGGACCAGGAGATATTAATGCATCACCCACCCCAAGTGGATGAACTTGACCGGC 

TAAATGCCCCACTTTCTCAGATGGCTTCTAACGACTTTCAGGATTAGGGCCAGCTGTGGG 

TCTACTCCTTGTTGGAGCCCATCTCACCTGGGATGCCTGCAGCCAGCCCTCCCTCGTGAT 

TTGTCTCACCTTGAGTAGGAGACATGCTTCTCCCCTAACCTTTTCCTTTCTGCCATAATT 

AACATATGTCCTTTTCAGTAAGTCCATGCCTCTGGCAGGGGATGAAAGAAGTACTCACTG 

GGTAATTAGCTACCATCTTTGCAGCACCCTGGTAACTTGAAAAATTT 

Sequence 15 

TCGGGAGTCGACCCCGCGTCCGCCGAGCGGGGCGGCGCGGCTGGCGGGGCCGGCGGCCGG 

CTGAAGCGAGAGCGCGACGCGACGCGACCGCGGCTTCCCGAGCTGCGCCTGGCCGNCCAG 

CGCCGCGGNCCGCCCGAGGCCTGGAGGGGTCCGGGCCGCCGTCCATGGTCGCGGCGTCCT 

GAGGCGGGGGACGCGCCCGGCGCCCCCGGCCCTCCTNCGCCTCCTCCCGCGGGGCGGGCG 

GCCTCCTCCGGCGCCTNCCCGCGCCCGCCCGCCGNTCGCCGCCGCCTCCCTCCCTCCTTC 

CCTGCGGCTCCCCCGGCTTTCGGAGCCCGGGGGCGGCCTGTGGCGCGCGGAGCCCGCGCC 

GGACTGCGCCTNTTTGGACCTTGAGGGGAMCATGCGTTTGCCNTGGATCGTTTGAAATT 

CTGAGTTT GGGAT CCCCGNCCGGCCGNCTGGCTTTTTCGCCCGCGGGI 1 1 1 1 ICCTTTTT 

TCCTTTTG C I I I I I 1 1 I CCTTCTT 

Sequence 16 

NGCCCCCGCGTCCGTTTTAATTATTTTGTTNGAGCCTGCANAGTAANGTTNTTAAAAATA 
TMCGTTCATACGCATTTTAATTAACTTTGAAAGTTCATATGCATCAGAAAATTTATGAA 
MTTTGMTGAAMMATTTCATCTATTTATTTTTCTAATTTTMTGGCAAATTTACACT 
ATTATGGCTGATAATTCTGTGAACTTACCTTCTTG7TGACTGATTC I 1 1 1 I CCCTTAATC 
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CCAGCTTTAAGGAGATAGGTGAAGTTATTGTACAAAGTTAAGTGATACCATAAAGTATAT 

ATTATAAAGTCATACATGGCTTTTGGACAGTNTTATATTTCAGTTGCAGTGCTGCATTCC 

ATTAAATTTCATAAAATGCTAGGGAAAATGTGTTTGATAAAATTTTNTGCAGTGAGAAAT 

GACAGACTGAGTGCCTGACAATTTAAGCCACATATGAAAGTATGCAAGTAAAGANTTCAG 

GTCCTTAATGTCATCTATATCATGGTATAAAAG 

Sequence 17 

CCACGCGTCCGGACGAGACGAGCCCACTAGTGTCCCCCGAGCGGGCCCAACCCCCGGACT 

ACACCTTCCCGTCGGGCTCGGGCGCTCACTTTCCGCAGGTGCCCGGGGGCGCGGTCCGAG 

TGGCTGGCGGCGGCCGGCTCGGGCCCCTNTCCGCCGGGCTCGCCGGGCCACGACCGCTGA 

GCGGCAGCCACTGTTGGATCGGGCCCGGGGCGCGGNGGCCCAGGGCCAGACCCAAACCGT 

GGCGGCGCAGGCCCAGGCTCTGGCCGTTCANGCCGNGGCGGCAGTCCACGCCGATCAGGC 

CCACCGNGAGCGGAACGAG 

Sequence 18 

GGGAGTCGACCNCGCGTCCGGGCGGTGGGTGTCCGCTTCTCTCTGCTCTTCGACTGCACC 

GCACTCGCGCGTGACCCTGACTCCCCCTAGTCAGCTCAGCGGTGCTGCCATGGCGTGGCG 

GCGGCGCGAAGCCGGCGTCGGGGCTCGCGGCGTGTTGGCTCTGGCGTTGCTCGCCCTGGC 

CCTGTGCGTGCCCGGGGCCCGGGGCCGGGCTCTCGAGTGGTTCTCGGCCGTGGTAAACAT 

CGAGTACGTGGACCCGCAGACCAACCTGACGGTGTGGAGCGTCTCGGAGAGTGGCCGCTT 

CGGCGACAGCTCGCCCAAGGAGGGCGCGCATGGCTTGGTGGGGCGTCCCGTGGGCGCCCG 

GCGGAGACCTCGAGGGCTGCGCGCCCGACACGCGCTTTTTTCGTGCCCGAGCCCCGCGGC 

CGAGGGGCCCGCGCCCTTGGGTCGCCCTGGTGGCTCGTGGGGGCTTGCACCTTTAAGGAC 

AAGGTGCTTGTGGCGGGCGCCGAGG 

Sequence 19 

NATGTNGNNCNAAAAAGGCCNGCNTTANAGGCCAGGAAACNCGTAAAAAGGGCNCGCGTT 

GCTGTGCGTCTTTTCCATAGGCTCGCGNCCCCCTGACCNAGTCATCATCAAAAAATCCGA 

CNGCTCAAGTCATGAGGTTGGCCGAAAACTCCGACAGGGACTTNTAANAGNATACCCANG 

GGCGNTTTCCCCCTGGGAAGGCTCCCTTCGTGGCGCNTCTCNNTGTTTCCAGACCCCTGC 

CCCGCTTTACNCGGNATTACCCTNGTCCCCGCCNTTTTCTTCCCTTTCGNGGAAAGCGGT 

NGGGCGCCTTCTCNTCAATTAGGCTTCACCGCCTGNTAANGGTATTCTCAAGTTNCGGNT 

GTANGGGTGCCGTTTTCGCTTCCAAAGNCTGGGGCCTTNTGTGCCACCGGAAACCCCCCC 

Sequence 20 

TTCGGGAGTCGACCNCGCGTCCGCCTGGAGCCGCCAGAGTTTCCGCACCCGGGAGGGAGA 

TGCGGCCGGGGCTCAGGCTCCTTGCAGTTGTAATTTAGATTCGAGAAGTGGTTTATCCTT 

TGACTGGAAAAGAAAAGTAGCTGCAGTATTCCCCCAGCACTTGCTGAGAGCATGCCGTAT 

GCCAGGCTGTGAGGCTCGAGAGACAAGCAGTGGAAGAGTTGCGGCCTGTTTCATCTCTGG . 

ATTGTAAATCTGAGCCTCCTTCTGGCCCCTGGAAGGGGACAGCATCACGATGGAATGATT 

CCTAACCAGCATAATGCTGGAGCCGGGAGCCACCAACCTGCAGTTTTCAGMTGGCCGTG . 

TTGGACACTGATTTGGATCACATTCTTCCATCTtCTGTTCTTCCTCCATTCTGGGCTAAG 

TTAGTAGTGGGATCGGTTGCCATTGTGTGTTTTGCACGCAGCTATGATGGAGACTTTGTC 

TTTGATGACTCAGAAGCTATTGTTAACAATAAGG 

Sequence 21 

CGACCACGCGTCCGGCAGCCGCGGGGCGGGCGGCGGCGGCGGCGGCGGCGGCCGGGACCC 

AGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGGGACTGGGGCC 

ACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGCGGATGGCCGG 

TCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCCTGGTGGCCCT 

CGCCCTGGACGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCCTGTCTACCTGCC 

GGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACTTCCGTGTGCAGCAGGTGGGCCACTA 

CCCACCTGCCAACTCCTCTCTGAGCTCCCGATCTGAGACCTTTCTGCTCCTACAGCCCTG 

GCCCAAGGCCCAGCCACTTCTTCGGGCCTTCTACCCAA 

Sequence 22 
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CGCGTCCGCCCGGTGCCTCCGCCCATGGAACGCGCGGAGTGGCGCCGCCGCGGCTACGCG 

CCGCTGCTCTATCTGCAGTCACACTGCGACGTGCCAGCGGACCGGGACCGCTACGTGCGC 

GAGCTCATGCGCCACATCCCGGTAGACTCCTACGGGAAATGCCTGCAGAATCGGGAGCTG 

CCTACCGCGCGGCTACAGGACACAGCCACGGCCACCACCGAGGATCCAGAGCTCTTGGCT 

TTCTTGTCCCGCTATAAGTTCCACTTGGCCCTGGAAAATGCCATCTGTAACGACTACATG 

ACAGAAAAACTGTGGCGTCCCATGCACCTGGGCCGCTGTGCCCGTGTACCGCGGTTCTCC 

CTCTGTGAGGGACTGGATGCCGAACAATCACTCCGTCATCCTGATTGATGATTTTGAGTC 

TCCTCAGAAGCTGGCAGAGTTTATTGACTTTCTGGACAAGAATGATGAGGAGTATATGAA 

ATACCTGGCATACAAGCAACCT 

Sequence 23 

CGCGTCCGGCTGGGCGAATNAGGGATTCCGGTTCACAATGGATGCTGATAAAGAGAAAGA 

TTTGCAGAAATTTCTTAAAAATGTGGATGAAATCTCCAATTTAATTCAGGAGATGAATTC 

TGATGACCCAGTTGTGCAACAGAAAGCTGTCCTGGAGACAGAAAAGAGACTACTGCTTAT 

GGAGGAAGACCAGGAGGAGGATGAATGCAGGACCACCTTGAACAAGACTATGATCAGTCC 

TCCACAAACTGCTCTGAAGAGTGCAGAAGAAATAAACTCAGAGGCCTTCTTGGCATCTGT 

GGAGAAGGATGCAAAGGAACGAGCCAAGAGAAGAAGGGAAAACAAAGTCTTGGCGGATGC 

CCTAAAAGAAAAAGGGAATGAAGCATTTGCTGAAGGCAATTATGAAACAGCTATCCTGCG 

CTACAGTGAGGGGTTTGGAGAAGCTGAAGGACATGAAAGTGCTGTACACCAACCGAGCCC 

AGGCTTATATGAAACTTGAGGA 

Sequence 24 

GGGAGTCGACCNCGCGTCCGCTCCCTCTGAGTTGCGCTGGGCTTGGCTGCTGCACCATGA 

CCCTGGAGGCGATCCGCTACTCGCGGGGCTCCCTGCAGATCCTAGACCAGCTGCTGCTGC 

CCAAGCAGAGCCGCTACGAGGCGGTGGGCTCGGTGCACCAGGCCTGGGAGGCCATCCGCG 

CCATGAAGGTGCGGGGCGCCCCGGCCATAGCCCTGGTGGGCTGTCTCAGCCTCGCCGTGG . 

AGCTGCAGGCGGGCGCCGGGGGACCGGGACTCGCCGCGCTCGTGGCCTTCGTGCGCGACA 

AGCTGAGCTTCCTCGTCACCGCCCGGCCCACCGCTGTCAACATGGCCCGCGCCGCCCGCG 

ACCTGGCTTGATGTTGCAGCCCGGGAGGCCGAACGGGAGGGGCGCTACGGAAGAGGCCGG 

TCCGGGAGAGAGTGATCTGCTGCACCGAGGACATGCTGGAGAAAGACCTCAGAGACAACC 
GAAGCATTG 

Sequence 25 

GGAGTCGACCNCGCGTCCGGGATAACGAAGCTGCTACCATGATGATGGCTGATCTCATGT 

TCAGAAAACAAGACTATGAACAAGCAGTGTTTCATTTACAGCAGCTTTTAGAACGTAAGC 

CAGACTCCTCGAGTTCAGGGATCACACCATATTCCCAATATCAGACAAAATGCCACACAC 

ATGGATGTGGCAACATAGATGTTTATTGGTTGAATGGATCAGTGAATGACTGTAAAACAC 

CAAGTCAATTAAAAACACAGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCCGTG 

AGCCAAGATCACACCACTGCACTCCAGCCTCGGTGACAGAGTGAGACTTGGTCTCAAAAA 

CMCCACAAMTTTTAAATAATTATATGACATTATCTCGTTTGATTGATCTCCTAAGAAG 

ATGTGGAAAACTCGA 

Sequence 26 

ACCGTCCGGGGCCATC CAGGA GAGCCTCCTACCAGCACAGAAGGCCTGTGCCCCAGCGCC 

CTGAGCGAGACAAGCCGTTTTGATAATGACTTGCAGCTAGCCATGGAGCTCTCTGCCAAA 

GAGCTGGAGGAATGGGAGCTCCGGCTCCAGGAGGAAGAGGCTGAGCTCCAGCAAGTCTTA 

CAGCTGTCACTCACTGACAAATAGACCTTTCAGCCTGTGAGCCTCTGCACAAAGCAGAGG 

CTGTGGGCTGTCACAGATGCTGTGTCAACCAGGGCCCTAGGGCTAAGGGCCTGCACCTTG 

CGTGCATGCAGCAGGCAACAACTGCCCCTTCTTTATGCAGAGGTGCAGAACCAGGGACTC 

CTGGGCCCATCCAGGCTGCTTCCTTGGGGTGG 

Sequence 27 

NCCNCGCGTCCGGCCGGCGATGCCGCGCCCCCGGGCCGGGCTGTAGCGGGGCCGCGGCTG 
GACGTGTGCGCCGGGCAGGCGGGACATGGAGGTGGTGGACGAGACGGAGGCGCTGCAGCG 
CTTCTTCGAAGGCCACGACATCAACGGTGCCCTGGAGCCCTCCAACATAGACACCAGCAT 
CCTGGAGGAGTACATCAGCAAGGAGGATGCCTCCGACCTCACACTGCCGGACTCTCCCCC 
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AGACTCGGGCTCCGAGGCCTACTCCCCCCAGCAGGTGAATGAGCCCCACCTNCTGCGCAC 

GATAACCCCTGAGACACTGTGCCACGTGGGGAGTGCCCTTC 

Sequence 28 

CGCGTCCGCNAGGGAGGGCCGAGCGAGGCGCAGGCAACCGGGCAGCAGGCATGATGGCCT 

CGCCTAGTGACTCCAGCCGCTCGCTGACCAGCCGGCCCAGCACCAGGGGCCTTACCCACC 

TNCGCCTNCACCGACCCTGGCTGCAGGCCCTGCTTACGCTGGGGCTGGTCCAAGTGCTCC 

TGGGCATACTGGTGGTCACCTTCAGCATGGTGGCCTCTTCCGTCACCACCACCGAGAGCA 

TNAAGAGGTCCTGCCCGTCTTGGG 

Sequence 29 

ACGCGTCCGATTTTAGTNGCAAGGAGTCCATGTGTTCAACTCCAGCATTTCCTGTGTCTC 

CAGAGACACCGTATGTGAAAACAGCGCTGCGCCATCCTCCGTTCAGCCCACCTGAGCCCC 

CGCTGAGCAGCCCAGCCAGTCAGCACAAAGGAGGACGTGAACCACGAAGCTGCCCTGAGA 

CGCTCACTCACGCTGTGGGGATGTCAGAGAGCCCCATCGGACCCAAATCCACGATGCTCC 

GGGCTGATGCGTCCTCGACGCCCTCCTTTCAGCAGGCTTTTGCTTCTTCCTGCACCATTT 

CCAGCAACGGCCCTGGGCAGAGGAGAGAGAGCTCCTTCTTCTGCAGAACGCCAGTGGGTG 

GAGAGCAGNCCCAAGCCCATGGGTTTCCCTGCTGG 

Sequence 30 

CCGGTTNCTTGTNNGTTNATTGATTTAAAATAGAATATCAATTGAATTTAGAAAATTCTC 

AAMGCCAGTTTAATGCTGTTCATCTTTTAAGGCCAAAAAAAGTTTAATCCAGAGGCAGT 

CTTTCATTCTGCACTMTTTATAATTTAGATCAMGAACTMTTATATATCTCAMTTTA 

ATAATAAAAAGGTATAGTAATGAGAATTAAAT7TATGGTAAATTATATAACTCAGAATGT 

TAAAGTAACTTGGAAATTCTTAATCTAAGTTAAGTATC I I I I I ATTTCTTACTTGTCCTG 

TTTGATTTATTAAGGGAMAGMMTTTTAAGGAGTTGCCAGTATTTCTTTGTC 

AAGTGGAATGTTTATTCACCCTTATTTATATACTTAAAAGACATTGTATTGGCCTGGTCT 

CGAACTCCTGACCTCAAAGGTGGATCCACCCACCTCGGCTTCCCAAAGTGCTGGGATTA 

Sequence 31 

GCCGGTGATTTTGAACMTTCTGAAATATTTCAGGTMGATTAATAACATCCAATTACAA 

ATATATGTTTCAATATTTTATACGTATGTCTACTTTGAAAGTTAAACCAATAGTATAGAA 

AGCCTAAGAATGAACACTGATTGGACATACTCACAGAAATTAAGGGAAAAACACATATTG 

TAAAATTCCTGTCMTGTTTGAGTAGMTACAGAAGTACATAGCAGTCTTCMTTTTTAA 

ACACAATTATGGGCTTATMCTGGACGTGACATGCATCATTTATTAGAACAATATTATTT 

ATTTATACTAAGTAAGGATATAAGATCACAGAAGCTTAGTGTTATAACGGAGACTTCACA 

GACATTCATAACTAATGTTTTCTAAGGCAAATAAGGGGCATAAACCAGAACTCATGGGTC 

AGTGCCAGAGGTMGTATAAAMGGTTATGTATGAMGACATTTATTTATAGGAGAATTT 

CTGAGGGATTCTATGCCTTTTCAACTTA 

Sequence 32 

NCGTCCGGGAAACTGGTTCNGATGGTGTCTGCCCAGGAGCGCCTGACACGCACCTTCACA 

CGCAGCAGCCACACCTACACCCGCACGGAGCGCACGGAGATCAGCAAGACGCGGGGCGGG 

GAGACAAAGCGCGAGGTGCGGGTGGAGGAGTCCACCCAGGTCGGCGGGGACCCCTTCCCT 

GCTGTGTTTGGGGACTTCCTGGGCCGGGAGCGCCTGGGATCCTTCGGCAGCATCACCCGG 

CAGCAGGAGGGTGAGGCCAGCTCTCAGGACATGACTGCACAGGTGACCAGCCCATCGGGC 

AAGGTGGAAGCCGCAGAGATCGTNGAGGGCGAGGACAGCGCCTACAGCGTGCGCTTTGTG 

CCCCAGGAAATGGGGCCCCATACGGTCGCTGTCAAGTACCGTGGCCNGCACGTGCCCGGC 

AGCCCCTTTCAGTTTACTGTGGGGCCGCTNGGTNGAAAGGTGGTGCCCACAAGGTGCGGG 

CCCGGAGGCAC 

Sequence 33 

CCGCGTCCGCAGGMATTGTTAAAAATAATTTGGGGGTGTTTATTGGGGAAGGAAACAGG 

GCCTTGACAGTGGAGGACTTGGAAGACATGTAATTTAAGATATAGAGTATGATTGTTGGA 

AMTMGCAT GGAGA TCCAGMGGAATCTTAAGAGTTTTTCTATGCAAGTGAAGATGGAA 

GAAAATATGTATTTTACAMAGATAAATTACAAGTACCTTATTTGCTTTGCAAAATAACT 

TATCATGTTCTTCCACTATTTTTATTATATTTTAATTTTAATGAAACTTATATMCATT^ 
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ACACTAMTTTTAMTACATGGCTCMGACAAAAAAATGGGGAAMTATTTTTTAAAAAA 
TCACCCCAAATCCTGGTACTCAGACATAACCACTATTCAAACTTGGCAGAGTAATATTTT 
TCCTGTCTGCATGTGTGCCACNATGTGTGCATGCATACCACAAAGTAGATGTTTTACTAT 
ATCTCCTGGGTTATTATCTGC I I 1 1 I ICCC 
Sequence 34 

ACGCGTCCGGGACAAGAGAAACATAAGCNGGAAAAAAGGAGGAGGAATAAACACACGCCT 

GTCCATAATAAAACTCGCTCTTGAAGACTCAGCGGCAGCCCTGCACCGGAGACTGACGAC 

TTGCGCGGCTGTGACCTCCGCCCTGCAGCGGACCCTCGACTGCCCTGCACTGCGGCTCTG 

GAGGCCCCGACTCAGTGCATGGGAAAGAAATCCTCACTATCAGAAAACAGAGGGGCAATC 

TGCTGCTCTCCCTTTCCGGCCAAACACGTCACCCATCAACCGGATACCTACCAAGAGGCT 

TTCAGAGGAGGCGCCCAAGGTCTCCCAGGCCCGCCCCTCCCCAATCACGCTCCGCTCAGC 

CCCCTCAACTTTTGGCCTCCGGGAAGTTCGCAGCGTNTCTCACGCTTGGCAGGAAGTTCC 

CGCCAAGGCTTTCGGAAAATCCTTTAAAAAGCAACGCTTGCGCTGGGCGGGGCTTTGGTG 

Sequence 35 

CCCCGCGTCCGGTAGATTGCTTGTGGCTGGCAGTGAAGATGGTGGAGTTCAACTTTTTGA 

TATAAGTGGGAGGGCTCCCCTCAGGCAGTTTGAAGGCCATACAAAGTAAGAGACAGTTGG 

TTTCTGTGTGTTCTGGTTTTATTTTGTTGTAAGCTC I 1 1 I 1 1 I CTCTGGACTTTGGTTAA 

AMGATAGAGATCAGTTTTATGGAGATTATTTGCCTATAGGTACTATATTTCCTGATTGT 

TCTAAGAGTGCTTAACTTGGGTTCCGTGGTGCAGTTTCATGGGGCTTATGAATTCCCTAG 

MTTGTATGTGATATTTTAGGAAATACACGTTTATCTAGGGAGCTACTCTGTAGCTTTTG 

GTTAACTTTAGTGGGGTCTGTGGCCCAGCTGAGATTATGAATTACTGACCTGAAGACAAC 

CTTACAGCTGGTAATGACAGCTCTATAGGCCTGTACTGTCTTAGAGGCTCTTATGTTGAA 

■GTCAAGTANGAAGGTGGATTTTCTTCTTGAATTATAGTGTTTTGCCCCTTAATAA 

Sequence 36 

CNCGCGTCCGCGGACGCGTGGGGGCGAGGGCCGCTGGGGCCGCGAAGTGGGGCGGCCGGG 
TGGGGTACGAGCCGGGTCTGGGCTGAGGGGCGCGGCTTCGCGGTGGACCCCAGCCCGGCA 
ACGGGAAGGCGAGCTCTCCTCCACCGTCCAAAGTAAACTTTGCCGCTCCTTCCGCGGCGC 
TCCCGAGTCCTCGCCGCCGCCGGGCCGCCGCAGTCCGCGAAGAGCCGTCCTGCGTCAGGG 
'CCTCCTTCCCTGCCCCGGCGCGGGGCCACTGCGCCATGGACGCCACAGCACTGGAGCGGG 
ACGCTGTGCAGTTCGCCCGTCTGGCGGTTCAGCGCGACCACGAAGGCCGCTACTCCGAGG 
CGGTGTTTTATTACAAGGAAGCTGCACAAGCCTTAATTTATGCTGAGATGGCAGGATCAA 
GCCTAGAAAATATTCAAGAAAAA 
Sequence 37 

GCGTCCGCGGACGCGTGGGCCGCCGCCGCCGCCGCCGCCGCGATGTGACCTTCAGGGCCG 

CCAGGACGGGATGACCNGGAGCCTCCGCCCCGCGGNGCCCGNGGCTCGCCTCGGCCTCCC 

GGGCGCTCTGACCGCGCGTTACCCGGCCCGCCATGGCCCCCTTCTCTCTNGCCCGGGCNC 

ACGCTCGACCCTGCGCNCGTATCGCCAGCAAGTCTGCTGCCTGCATGCTGCNTGGCTCGC 

GAGNNGTTGGCGCGCNTTGCNCTGNNGCCTGTTTGCACGAGCNGCCGCCGAGGCCCACTG 

CANGNTTANCTCGACGGTCTCCNAAGNCANACCGCNCCGCCGNCCCTTTCCAAGGNTCNT 

TTCNCGAGGGGANGGGCNCAAANGGCCAGGGGNCCCCACNCNTGGGGMGAAGGGNGGAG 

TTNGNCNGATGNGNATGAGNAANCCTTGTAGNCAAGANCCCGTCCCNAAGNGGNAGNGNC 

AGCCCNGGGGAAGGGTTGGTTTTCNNANGNTAAACCGNAACCCCCCCGGAAGTNAACCGG 

NGATTTTMTTTATTTTTACCCCNCANANGNAAATAACCTTTNAANAACCATGGGNCATT 

NNAAANCAAAAGNTAAATTNGGGGGNAACTTAACCGNTAAACCANCCCAAAA 

Sequence 38 

CNCGCGTCCGGCGGGTCCCGCCGGCGGGTACCTGGGCACTGCGCCCCATCTGGACTGAAA 

TGGGGACACCCCTTCGGGGGTCCCAGGCTCCTGGCCGTATTGTTCTCCTTCTCCTCGTGA 

TAACTCCGCAGTGGAGGTGGATTCCGTCCAAGACGCCCAACGTGGCTCCGCGTAGCAATC 

AGCGCTGCAATCCTGGCGGTTACCTCAGCGGCGGCGTCTCTCTCTGCGCCTCACACTCGC 

AGCCCGCGGCCCTCCCCAACTTAGGGCGTTTACAAAAGAAACTACTCCAGACGCGCTGCA 
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AAGGGAGGCGCATGTGCCCGAAAGCTGGCGATCAGACGGGGGGGGGCATTCTGCATGTGT 

GATGTTTCTGGGGGCGGTGGGGAGTGTGTGTCGGGGTCGGGGGGCGGG 

Sequence 39 

NTGGTCTTTCTTCCTAAGAAGGAGATGGAGCCAATTTCTCACAGCTCGTGCCTCAGTACT 

GAGGGTATGGAGGAAAAGGCAGTCAGTCAGTGTCTAAAAATGACGCACGCAAGAGACGCT 

CGGGGAAGATGTAGCTGGACCTCTGAGATTTACTTTTCTCAAGGTGGACAAGCTGTAGCC 

ATCGGGCAATTTAAAGATCGAATTACAGGGTCCAACGATCCAGGTAATGCATCTATCACT 

ATCTCGCATATGCAGCCAGCAGACAGTGGAATTTACATCTGCGATGTTAACAACCCCCCA 

GACTTTCTCGGCCAAAACCAAGGCATCCTCAACGTCAGTGTGTTAGTGAAACCTTCTAAG 

CCCCTTTGTAGCGTTCAAGGAAGACCAGAAACTGGCCACACTATTTCCCTTTCCTGTCTC 

TCTGCGCTTGGAACACCTTCCCTGTGTACTACTGGCATAAACTTGAGGGAAGAGACATCG 

TGCCAGTGNAAGAAAACTTCAACCCAACCACCGGGAT 

Sequence 40 

CACCGNNCGCGCTCCTTCTGCCGCCAGGGCGAGGCTGGCACCCGGCCAGCGCGGGCAGGG 

CCACGGGTGCCCGGCTGTTTCCCGGTTGTGGAAGGCGCTCAAGGTGCGCGGCCCGGGGCG 

CGCTACTGGGGGCGCCCTCCGCGGTGGGCAAGCGCGCCAGGGATCGGCCTGGGCANGCCG 

CGGGGCGCGCGAANGCTGCGCTTTCCCTACGCCCCCCCTCGCTTCCTCCGGCACGGCGGC 

AACGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTTCGGGGATCCTCGCCCCGCCC 

CAATTTCTNCGCCCCCTCCCCTTTGCTGGGGCGCCTGGGCTGGCCCGCGCAGGGGA 

Sequence 41 

CNCGTCCGGTTCCTAACACAGACGAACTCAGC7TCCTTTGCCATGCCTCTGACTCGAGCC 

AGCCTTTCTTTTTATCCTCCG I I 1 1 I CTCAGATTCCCTCCACACAGTCTTTTCCCAGGTC 

TAGATCGCTTCCCTCGCCCCAATCTTTCCTTGAACCCCTTTTCCAGTGTCCCAAAGCTGT 

CCACTCTTACGCCTCTTCCAGAAACACAGGCTACCTCCCCCCAATTCCCCAGTGCCACTC 

TGGATTGTAATATCCCACTCAGGAGCTTCTTTCTTGTAATTTCCCTCCCCCCACCCCCCA 

GCGTCCCCGGGTCGTCTGTTTTCTTCTATGAAGCAAATATTACTCATCAAATATAGGAAC 

AAAGGCCTAAGTCCTTTCTGNGCTTTATTTCTrNGGGTGACTGGATCTTAGATCCTATCA 

TTTAAGTAGATGATGGTT 

Sequence 42 

GGTGTCGACCNCGCGTCCGCGCTTCTCNCCTCGGCCCGTGGAGCCGGGGCGTCCGGGCGT 

AGCCCTCGCTCGCCTGGGTCAGGGGGTGCGCGTCGGGGGAGGCAGAAGCCATGGATCCCG 

GGCAGCAGCCGCCGCCTCAACCGGCCCCCCAGGGCCAAGGGCAGGCCGCCTTCGCAGCCC 

CCGCAGGGGCAGGGCCCGCCGTCCGGACCCGGGCAACCGGCACCCGCGGCGACCCAGGGG 

GCGCCGCAGGCACCCCCCGCCGGGCATCAGATCGTGCACGTCCGCGGGGACTCGGAGACC 

GACCTGGAGGCGCTCTTCAACCGCCGTCATGAACCCCAAGACGGCCAACGTGCCCCAGAC 

GGTGCCCATGAGGCTCCGGAAGCTGCCCGACTCCTTCTTCAAGCCGCCGGAGCCCAAATC 

CCACTCCCGACAGGCCAGTACTGATGCAGGCACTGNAGGAGCCCTGACTCACAAGCATGT 

TCGAGGCTCATTCCTCTNCAGCTTTCTTGCAGTTGGGAAGC 

Sequence 43 

GTCGACCACGCGTCCGGGAGCTGCGGCGCCCTCCCTTATCGCCTTGGCAACGACCCAGCC 

GCGCCGCGAGGAGAACCGGGAATGGAGGTCGTGGCGTGAGGGGCGCCGAGCGAGGGGAGG 

CGCGGGCCACGGGAGTTCCGGGAGTTCCGGCGTTGCCCGGCAGTCCGCAGTCCTCGGCGG 

GAAGGCTGTCCCGGCGCCTCAGGCAGCTCTGCGTGGGCCGGGGTGACTTCCTCGCGATCC 

CCTGCGCGAGGTGAAGGGCAGGGACCTTTGCCGCGCCTTCCACGCGCCGTGCCCCACCGG 

CGAAGTGGGCTCCATCTTCCTTCAAGACTTGTCGCTTCGCGGACAGGGCGCCCGTGGGTT 

CTTCCCGGCCCT TCCGT ACCGTCCTCTTTCCAAGCGGGCTTCCGGAAAGCGGGTTCCTTG 

TCTTTCGGACGCATTTTCACCCCGCGCCGGGGAGGAGCTTNCCGGGNAAGGGTTCCACGG 

CGGCCGAGGGTTTTCC 

Sequence 44 

CGTCCGCGAGACTCCCGCGCCCACCACCCCCGGCGGAGCTGCTGCTGAGCCACTCAATCT 
GAGCCCTGGCTACTAATAAAGTTCGTTTAAAAATCATAATCATTCTTAAGAGAGCGAAAG 
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AGGGTGCGAACTAGCCGCTCGGCCCCGCANGGAGAGCTGGCGCGTCNGGAGGAGACAGCN 

GCGGCAGCGGTTGCGCCGCGACCAGGAGGAGCCGGTGGCGCCGGGCGGCGGGTCCGCGGC 

CGGTGGGGGACNGTGAGTAGCGGCTCGCGCTGCGGTGCAGCGGCGCTGCACTCACTCGCC 

CTCTCCAGGGGCTGGGGGTTCGTCCGGCTCCACTGGGGAAGACTCAGCTTCTCCCCGGGG 

TCCCGGGTTGTGCCTTACTCTCCGGAGTGGGCAGGGGTATCGAGGGCAGGGGCCTCCCGG 

CCCGGCTCCCCCATCCCCCGGTTCGGACCGACGAGCCGCCGGCTTCTTCCCTTCCCTGAG 

CACCGATCCCAAGTTCCA 

Sequence 45 

TCACCACGCGTCCGGGCGTCCGGGTACCCGAGGGCTCTCCCGCGTTGCTGGCACCGCTGG 

CGCCGCGGTCTCGTAGCGCATGGGCCTGCTCCGAGGCGGGCTCCCATGCGCTCGGGCCAT 

GGCGCGCCTGGGCGCTGTGCGCTCCCACTACTGCGCCCTGCTGCTGGCCGCGGCGCTGGC 

CGTCTGCGCCTTCTACTACCTCGGCTCAGGCCGGGAGACCTTCTCCAGCGCCACCAAGAG 

GCTGAAGGAGGCCCGCGCCGGGGCTCCCGCCGCGCCCTCGCCGCCCGCGCTGGAGCTAGC 

GCGGGGCTCCGTGGCGCCAGCCCCCGGCGCGAAGGCCAAGAGCTTGGAGGGCGGCGGTGC 

CGGGCCGGTGGACTACCACCTGCTGATGATGTTCACCAAGGCGGAGCACAATGCCGCGCT 

GCAGGCCAAGGCCCGCGTCGCGCTTGCGCTCA 

Sequence 46 

CCACGCGTCCGTGTCGGGCCGGCGCTCCCTTCTCTGCCNGGTGGCGAGTACACCTGCTCA 

CGTAGGCGTCATGAGGTCTCCGGTTCGAGACCTGGCCCGGAACGATGGCGAGGAGAGCAC 

GGACCGCACGCCTCTTCTACCGGGCGCCCCACGGGCCGAAGCCGCTCCAGTGTGCTGCTC 

TGCTCG7TACAAC7TAGCAATTTTGGCC I I I I I I GGTTTCTTCATTGTGTATGCA7TACG 

TGTGAATCTGAGTGTTGCGTTAGTGGATATGGTAGATTCAAATACAACTTTAGAAGATAA 

TAGAACTTCCAAGGCGTGTCCAGAGCATTCTGCTCCCATAAAAGTTCATCATAATCAAAC 

GGGTAAGAAGTACCAATGGGATGCAGAAACTCAAGGATGGATTCTCGGTTCC I 1 1 1 I I I A 

TGGCTACATCATCACACAGATTCCTGGAGGATATGTTGCCAGCAAAATAGGGGGGAAAAT 

GCTGCTAGGATTTTGGGA 

Sequence 47 

CGCGTCCGCGGACGCGTGGGCGGGGCCGCGGAGCCGGGCCGGGGCATGCGCCGTCTCCGN 

CTCGGGGCCGNCGGGGGCGCCCTGCTGAGCGCTACCCACGTGCGTCCGCGCCACCTCGCG 

GGCGACCCCGCGGCCAAGGCCCCCGGCGGAGCGGNTCCCGGGCGCCCCGAACTAGCCCCC 

AACTTTGGGCGAAGTTTGCCTGCGCCTCTCCCCGCCCCCACGCGGCGCGCCGGGGCCGCG 

GACGGNAGCGGCCCCCGGGGATGCGCCTTCCCGGGGTACCCCTGGCGCGCCCTGCGCTGC 

TGCTGNTGCTGCCGNTGCTCGCGCCGCTGATGGGAACGGGTGCGCCGGCCGAGCTGCGGG 

TCCGNGTGCGGCTGCCGGACGGCCAGGTGACCGANGAGAGCCTGCAGGCGGACAGCGACG 

CGGACAGCATCAAGNCTCGAGCTGCGCAAGCCNGACGGCACCCTCNTNTTCTTNACCGCC 

GACTTTAAGA 

Sequence 48 

GCGTCCGCTGCATTGCGCCCACCGACTCCACTATGTTGAAGAAATTCGACAAGAAGGATG 

AGGAGTCAGGTGGAGGCTCCAACCCA7TCCAGCACCTTGAGAAGAGTGCGGTACTCCAGG 

AGGCCCGTGTATTTAATGAAACTCCCATCAACCCTCGGAAATGTGCCCACATCCTCACCA 

AGATTCTTTATCTCATAAACCAGGGGGAGCACCTGGGGACCACGGAAGCGACCGAGGCCT 

TCTTTGCCATGACCAAGCTCTTTCAGTCCAATGATCCCACACTCCGTCGGATGTGCTACT 

TGACCATCAAGGAGATGTCTTGCATTGCAGAGGATGTCATCATTGTCACCAGCAGCCTAA 

CAAAAGACATGACTGGGAAAGAAGACAACTACCGGGGCCCGG 

Sequence 49 

ACGCGTCCGCAGAANGCTCTCAGATGGGACAGTCTTTTACTTTCATTCTCACCTCTGTAA 

ATAGCAGGACAGGTTGGGGTGGGCCTGACTTCTATTCTGCTTTCAGGGGGTACTTACTGG 

MAATCAACTTAGGMGTGMTTTGAGGGTTGGTGTAATTTTAAGCCCAGCCTCTGATCC 

TTGGTTGCACAAAGCCTAAT7TCCAAATATTTCTAACAGATTCAAGACTGTATTGGCAAA 

GAGGTAGAGAGCTATGATAATGACATAAATTACATAAAAATCCAGTTGAATGAATAAGAA 

GGAATTTGGGCGTATAACCCATGGAACACCAATGGTGCTAAGAATTTGCCAAACCCTCAG 
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CTTCGTATAAGTCCCAAAGAACTCAGGCTTATAGCACTAAGCAAATTACCAGTGGGAGGA 
GGAGGACCTGCCACGGAGCTGGATMTTACATTTAAATATTTTTGCCANTCTGTTGGAGA 
C 

Sequence 50 

NCGCGTCCGGCAGGCGGGTCGTAGAGAGCGTTCANGCCGTCTGTATATCTCCCCAGATAC 

CTGAAACTGACCACCTGAGTACGTTTTCCCATTGCTGAGCTGTTTCCCTGATATCTGGCC 

ATGCAACGGAGATCAAGAGGGATAAATACTGGACTTATTCTACTCCTTTCTCAAATCTTC 

CATG7TGGGATCAACAATATTCCACCTGTCACCCTAGCAACTTTGGCCCTNAACATCTGG 

TTCTTNTTGAACCCTCANAAGCCACTGTATAGCTCCTGCCTTAGTGTGGAGAAGTGTTAC 

CAGCAAAAAGACTGGCAGCGTTTACTGCTCTCTCCCCTTCACCATGCTGATGATTGGCAT 

TTGTATTTCAATATGGCATCCATGCTCTGGAAAGGAATAAATCTAGAAAGAAGACTGGGG 

AAGTAGAT 

Sequence 51 

GCGTCCGGAAGGTCCTTGAGCCATCTGGATGGCGGGCAGTCTGGCACACTAATGTGTTCA 

AGGTGCTGGTTGAGATCACAGATGTGGACTTTGCAGCCTTGAAGGCAGTGGTGAGGCTTG 

CTGAACCATACCTCTGTGACTCTCAAGTGAGCACTTTTACCATGGAGTGCATGAAGGAGC 

TCCTTGATCTGAAGGAGCATCGGTTGCCCCTGCAGGAGCTGTGGGTGGTGTTTGATGATT 

CAGGA 

Sequence 52 

GTCGACCNCGCGTCCGGAAAAAAAAAATGACCCAGAGATATTAACAACTTGACCTGGTTA 

TACAGTTAGATAT7TGCACAGTCTGGACTCAAACTGGAGGC7TCTGACTCCTCATCTAGG 

CTCCTCTCACTCTGCCATTGCATGGGTTTTCTCATATACCTTCTCTCATAAGGTTTTCAC 

AAATTTGTCACCGTCAAATAATTATCAAAATTA7TCACACTATTATAGATGAAAATAATG 

TGCTTATAAAGATTAAGTAACTTTCCTGAGGGCGCAGGTATCTGGTTCACATAACAACTA 

GCCTGGCTTAGAATAAACACATATTTCCTGGTTCTGAAG7TGGTGTCCTTCCTACCACTT 

TCTGCTGTCTCCTAAAGATAAAGAATGTTATTGGCTCACTGAATTAATCCATTCTGTTCC 

TGGCTGAAATAAAAATTGGTATATTCCTTACGTGAAGTGTCAACAGGAAGGGGGCTTTTA 

CAACTTCCTTT 

Sequence 53 

GGAGTCNCCACGCGTCCGCGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGC 

TGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTGGCGAC 

CAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAG 

TGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGGCGGACGGA 

GAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCGCCGCCTCC 

CCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGG 

GAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCA 

CAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTC 

CTNCTCTTCTCTTGGAGGGCGCTCTGGCCCATCAGACCGGATTAI 1 1 1 1 I CAA 

Sequence 54 

CNCCCGTCCGGAATNCCCATAGTTAGCTGCTGTGCTTTCACAACTTCTTTCTCTGTAAAT 

TCCTCGCTTGGCNCTGAGAAGGAAAAAAGATGTTCGTAAAGGGCTCAGCGAGGAATTTAC 

AGAGAAAGAAGTTGTGAMCCACCATAGTTAGTTGCTGTGCTTTGMTTTCTTTTGCTCA 

AATGGCCTCAGCGAAATCTTATTTGCCTATAGCATATCTACAAAAAATTTTCCTAGACCG 

TCTTTTCTACAACTGGATGGTAAAGTTGATTGAAGTGTGCCTCATGTAGCTTTATGTTTG 

GGGCATTTGAAGGGCTATGGCTGGACCAGAGTGTAATATAAATGCTTAATAGAGAGGGGA 

AAAGAAGAGTGTAAGAACCATTATAGGGCTGGGCTCACGCCTGTAATCCCAGCATTTTGG 

GAGGCTGAGGCAGGCGGATCACGAGGTCAGGAGTTCNAGACCAGCCTGACCAACATGGTG 

AAACCCCATCTCTCTAAAAATACAAAAAT 

Sequence 55 

GTCGACCNCGCGTCCGCAGCCTTCGGCTGCGGAGGGGGCTCGGCGGCGGCCGGCGGAGAA 
AGTTGCTCCGAGAAGAGGCTGGGTCGAGCTGGGCCGAGCCGGGCGCGCAGGGCGGGCGTC 
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GCGGGCGTCCCGGGCGGACGCGGCGCGGAGACTGCCGGCGCGTCCCGGGGGTTCCGATTT 

GAAGACCTTGCTTCTCATCACCCACTGGATTATGCCCCAGGCTTTCCTACCCAATGATCC 

TCTTGCAACACGCCGTGCTTCCTCCACCTAAGCAGCCCTCACCCTCGCCTCCTATGTCAG 

TGGCCACCAGGTCTACAGGAACCTTGCAGCTTCCACCACAGAAGCCTTTTGGGCAGGAGG 

CTTCCTTGCCTCTTGCAGGGGAAGAAGAGTTATCGAAGGGAGGGGAGCAAGGACTGTGCC 

CTGGAGGAGCTATGTAAGCCCCTGTACTGCAAACTCTGCAATGTCACCTTGAACTCTGCA 

CAGCAAGCCCAGGCTCATTATCANGGTAAAAATCATGGTAAGAAA 

Sequence 56 

ACCNCGCGTCCGGACCTGTTGGCGACATGGTGGCACCCGTGCTGGAGACTTCTCACGTGT 

TTTGCTGCCCAAACCGGGTGCGGGGAGTCCTGAACTGGAGCTCTGGGCCCAGAGGACTTC 

TGGCCTTTGGCACGTCCTGCTCCGTGGTGCTCTATGACCCCCTGAAAAGGGTTGTTGTTA 

CCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGATG 

GCTCCCCTTCTACTGAATTAGTTTCTGGAGGATCTGATAATCAAGTGATTCACTGGGAAA 

TAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTATG 

GGGTGCATGCTGTTTACCAGAGGAGGACATCAAGATCCTGCATTATGTACACTGATCGTT 

TCTGCAGCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAAGTAATGTG 

CCTTCAAACTTTAAACTTTGGAA 

Sequence 57 

GTCGACCACGCGTCCGTCTCATTTGTGAAGAGGTCTGTCTTCCTGAGGAAGCAGGGGACC 

CTCACCTGTGAACCAAGTGTGCCATGGGAGCTGCTCCATGTCCAGGTCCAGGTCTCCTGG 

TCTGCAGGGAACGGCACAAGAGGGCTGGCCTAGGCCAGGAGGATGTGATCTGTCCTAGAA 

GGGGGCTGACCTGCTTGCTGACCCCGCTTGCTGCTGCCTGGCTGACCTGACTCAGCCACG 

GCTGTTCCGAGGGCCCTTCTGAGTACGAACTTCCAGTTGGAGGATCTGGGTGMGACCCA 

GCTGCTTGAGATAGCAGCCTCTGGCTAGGCCCTTGGCGTGGCCAAGCCAATCAGGCAGGT . 

TTAGAGCCTGGTGCCCCTAGACAGGTCCTGCAACCAAGAACAGGGGGTAGCCTTCAAAGG 

CCAGCCCTGCCTTCCAACCACCGCTCCACAGCGAGGGAAACCAAGGCTCTTAGGGCAGGA 

GGGTTGT 

Sequence 58 

CCNCGCGTCCGGGGAGCAGGGATCAACGGTGGTCCCCGTAAACCTGACAGTAAACCTGAC 

AGAGGCTGCAGGAGTGCATTTCCACCCAGGGTGCACTCAGCGAGTGGAACTCCACACCCG 

TTTCTTTGGAGTCAAGGCGCGACCTCTCAGGGAGGAGACTGCTCCTGGTTGCCCACTGCC 

GGGTCATCCCAGCTTGCAGTGGAACCCTCCGCAGCCTGGCCTCTTCCAGGGTAGCCGCTC 

ACTCCCCTCTCTCTTGTCCTAGGATAAGGCCGAGGAAGGCTGACGAGTTCCAGCTCTGGG 

GATGCCCTATCAGCTGTGTCACCTTGAACAAATCAT7TCTCCTCTTGGGTCTCTGTTTCC 

TCCAGTGTGAAACGTGGTGAAGGCATGAGGGGCTATGGGAGCCCCAAGGCCTCTTTCAGA 

GATCTCCTCTGGGTCCCATGTGACCCCGTGGCTATCCCCAAAGGCAAGAGGGTCCCCAGC 

CCTGCACCAAGGCCCTGGG 

Sequence 59 

CCGAGGAAAGGAGTTGGTTCGCGCAGGTGCGGCGCCTGGGTCCCCATGGCGCTGTGGCGC 

GGCTCCGCGTACGCGGGCTTCCTGGCGCTGGCCGTGGGCTGCGTCTTCCTGCTGGAGCCA 

GAGCTGCCAGGCTCGGCGCTGCGCTCTCTCTGGAGCTCGCTGTGTCTGGGGCCCGCGCCT 

GCGCCCCCGGGACCCGTCTCCCCCGAGGGCCGGTTGGCGGCAGCCTGGGACGCGCTTATC 

GTGCGGCCAGTCCGGCGCTGGCGCCGCGTGGCAGTGGGGTGAGTGCCAACGGGGCCTGGG 

TCTNTGAGCCTCCGAGGTCGGCCTTGGAGGTCGGGCGGAGCCGCGCAGAAACAGGGCTTC 

TCAGAGGNCCCCGGGAGGCGCTTGCTGTCCGCGCTGGCCCC 

Sequence 60 

CGCGTCCGGTGGGAAGCCAGAAGATAAAACCAAATGGCTGGGCACGTCTTTAGGTTATTC 

CTAGCTMGAGTTAAGAGTTGTAAGCTCTCTCATTCTTTGTTCTTCAGCCTTAAACTATC 

TTTCCTTCTATTMCTTTATTTGTCTCAGTTACAATGATAGAGGTAACTTCACATACTAA 

AAGAAATTAGGTTACCATGTGAAACATTCTTCTTGGCTTGTGCTAATGTTATCAGATCCA 

MCAGCATCTGAAAGAAAATTTTCCAAGTACGATGTTGTTCTCTTGTTTTCTGAAATACA 
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TATCATATGTTMAGTGAGAGTTTTTATACATGTTGAAAGAAGTTGAATGACATAACAAA 

TAGTTACTGAGGCCTCCATTTTCTTACTTCACAGTTAAAATTCCTGTTTCTCTTTGGGTA 

TAGGAGGGTAGAAAGAAGTGGGAGAGTAATAGCATTTTAAAACACAGAATCAAAAATCAT 

ATTAAAAGTAG 

Sequence 61 

CGTCCGGAGCATCGCGCACTGCGGCCGGGTCACCGACGTGGGCATCCGCTACGTGGCCAA 

GTACTGCAGCAATGCTGCGCTACCTCAACGCGAGGGGCTGCGAGGGCATCACGGACCACG 

GTGTGGAGTACCTCGCCAAGAACTGCACCAAACTCAAATCCCTGGATATCGGCAAATGCC 

CTTTGGTATCCGACACGGGCCTGGAGTGCCTGGCCCTGAACTGCTTCAACCTCAAGCGGC 

TCAGCCTCAAGTCCTGCGAGAGCATCACCGGCCAGGGCTTGCAGATCGTGGCCGCCAACT 

GCTTTGACCTCCAGACGCTGAATGTCCAGGACTGCGAGGTCTCCGTGGAGGCCCTGCGCT 

TTGTCAAACGCCACTGCAAGCGCTGCGTCATCGAGCACACCAACCCGGCTTTCTTCTGAA 

GGGACAGAGTTCATCCGGCGTTGTATTCACACAAACCTGAACAAAGCAA AI I t I I I I AAA 

AGCAGCGTATGTAMGCACCGACACCCACTCAAAACAAGCTCTTTCTTTCNGGAAGGGTA 

TTAAGGAAT 

Sequence 62 

NCCACGCGTCCGCCAGNCTGTGAAGGATCCCAGACTGGCATATGCAGGAGGAAATGGGGC 

GGGCGAGGAGTAAGGACCCCAAAAAGCAGGGGTAGGGAAGGGCCCTCCCAGCGCCCCACT 

GTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGAAAGGTG 

AAAAGAGGAACAAAGAACATTCTCCTGGACGCCACCCACAGAAAGCCACGTGCAGGCTTG 

GCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTCAGAGCT 

GGGGCAGCGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTAGTTACCGGCAC 

ACGGGCACCAGCGCCCCCTCTCAGCAAACTTNCACGTCTTATGAAATTAGCACTGGATTT 

CCACTTCAATTGGA 

Sequence 63 

CCCACTGTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGA 

AAGGTGAAAAGAGGAACAAAGAACATTNTCCTGGACGCCACCCACAGAAAGCCACNTGCA 

GGCTTGGCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTC 

AGAGCTGGGGCAGCTGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTATTTA 

CCGGCACACGGGCACCAGCGCCCCCTNTCAGCAAACTCCACGTNTTATGAAATTAGCACT 

GGATTTCCACTTCAATTGGA 

Sequence 64 

NCGCGTCCGCTTCATCTTAGGATAAAGTCTAAATCTTTG I Mil I GCTATTGTACTAAAC 

TCATAMTCCTAGGTTATAMGATMAGCCTTAAACTTTATCTCATCATCCAGCCCAATT 

TCCAGCCACMTGMGTACTTMMCTCTGTGTCTTTGTACTTGCTGTTCTCTTGGCCTC 

CMTTCCTTTTCATCTTTTCCATTCGGTAAAGTTTGTTTATCCCACAGGCCCTAT^ 

AAGCCTCCAGCAACTTCTCCAGACAGAGGTGTTAGCAGTGTAGGATCAGATTTCTCAACC 

ACGTCACTCCCATGTCTGGGTAGATATCTCTGCCCAAGTGTTCTCATAGCACTTGAGCAG 

TACTCTCTAAGCGCCCAGGATCTACCATGTTGC I I I I I I AAAATTTGATTAATTTATTTT 

TTTATACTGCTCCTTGTGGAGCANGGAGTGTTCCCAGAGTAGCCCACCATGTTATATTGA 

ATGGATCTGTGTGCATMTGCAGCTGTCCATCTACATCGTATATTTTTGTCTCCTCAAGG 

GTAGGGA 

Sequence 65 

GTTTGTATTCTGGATACAGGGGATACTGGGGCTCGCTATGTGTGTGGAGCCATCCCTTCC 

TTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTCTGAGCTTCCTTTCCTG 

CTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTGCTACTTCGGGTCAGGT 

GGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACTTCAAGAGGAGAGGCTG 

CAGGGAGCCATCCTAMCCCAGAGGCCTGGAGCTTACCGTGTCACTTTACTTTTGTACAC 

AGGGGTCTCCTTAGTGCCCTCGAGAAGGATTCTTGGCCCTGAGCTTCTACTCCTGAGGCC 

ACCTCTGTGCAGCCCCAGCTCCCTCAACTCTAGGCTGTAGTCTCAGTGGGAAAGCCTGGC 

TTGGGGGTCTCCTAGGAATGTCCACCTGAAGGCACACTTGATAGGGGCTTGCACAACTTA 
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NOT 

Sequence 66 

CGTTCGGCAGCAGCAGCTGGCGGCCAACCGGCACAGCCTGGTGGAGAAGCTGGGGGAGCT 

GGTGGCGGGTGCCCACGCACTGGGTGAGGGCCAGTTCCCCTTCCCCACTGCTCTGTCGGC 

CACCCCCACGGAGGAGACGCCCACCCCACCCCCAGCCGNCACCAGCGACCCCCCGGCCGA 

AGACATGCTGGTGGCCATCCGGCGTGGGGTCCGGCTCCGCAGGACCGTCACCAACGACAG 

GTCGGCGCCCCGCATCTTATGATGGCGCCACCCTCCCCATCCTCTCAGGCCCCAGTGCGA 

GCAGGTGGCCTGGTCTGTGAGCCGCAGGCACTCAGAGCAAAGGCCCAGCCAGGAGAGAGG 

ACAGAGCCAGGGCAGAGGCCATGCCACTTTATGGAAAGACACCTCACTTGGATTCCAGCA 

TTTAAACAGGAAGTGACTTCTTAGCAAGCCTGGCCAGGACGGAGCCTGCAGGCCTGGGCC 

TGGTTCGGGGTCTGTTTTATGCTCTTCGGTCCCTTCTCTTCCTTCTCTGGGGCCGTGCCT 

CTTCCTACCCATAAAGCACCAAAACCAGGGCCGCTGCCATGACAGAGGGGCCAGGCTGGC 

CTTCCTTTCACATCCCGGCCTNTTCCAAGGCTGGTCCTGCCTNACTTCTTCTGGAATGTG 

GGCCCCCTCTCCCTTGCTGACCCCTCTTCCTNTTGCTTWIsrrrrGCTTTTCCANGCCCT 

Sequence 67 

GGCGT CCGGCAACCTAAAAATAGGATGCACCAATAGCATGTGGTTCCAAGTAAGTTGTGA 

TTTTTATTTTGAGACAGTGTTCCACTGGAAGGGAGGGAAGGGCTTACATTCACAGACAGT 

AMGCAGGGCTGTGTAAGGAGCTACATTTACTTAACAGTCTCCTTTCCAGCTAGGTTTGT 

TTTATTTATTTGCAAGTCAGCTAAGAATTAACCTTTTAAACCATCTAAAACAGGCAAGCA 

ATATAAAGATTTCTACTAGTGCAAGGTAAGTGGTTTGAATATACAAGTGCCCTTTCCTGC 

CACCCAGTCTCACTACCGTTTTAGTCCTGCAGCTGGGTAAAGCCACTATTGTGTGGAAAC 

TCTCCCATGTGCCCTGTCTCTACCTCTGGACACACCAGCTCCTTCTTCCACCTATTCTAT 

TCCTCAGTTAAGCCAAGTGATCAGAAGTAGTATTAAATGGGTAGATAATTTTA 

Sequence 68 

GCCACGCGTCCGGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCGGGCTCCGCGCGGAC 

CGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGA 

ATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTTCTTCCTCACT 

TCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGGAAAGGACTGGGGAAAATA 

GCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTGTCT 

GATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCGCAGG 

ACAGACGGCCGATCCCGCCGCCTCCGTACCAGCACTCCCAGGGAGAGTCAGCCTCGCTCC 

CCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCACTGTGATTCTCAATTCCTGC 

NTGGG I I I I I I I 

Sequence 69 

ACCCANACCTGGGAGGAATTAATGGAATGC7TGNCCCTGGGCAGCCTTAGAAACAGACCC 

NAGCTTATCTAANGCTGCTCCGAGGCAGTGACCCAACTANGGCTCAGGAAGTCAAGAANA 

TTGACCAAGCTTATAGTGATCACCTCTTGACCTTTGTGTCACAGTCNTTTTGCI I I I IAA 

AACCCTTTTGTGAACCGNTTATGGCCTTTGATTCTGACAGGCATCNTAGTTGTGAAGGGG 

AACANGGGCAGGATATAATGTTTCGTTTACCAAATACAANAAAATCNGANGTACCCAGNT 

AGATCACAANATTTTTTGGGAGAAGGNCTNTTGGGTCTCTTCCAGGAGNTCACTTCANNN 

TTGGNAACTTGACAGGGGCTTGGGGAATTTANTATTCCCCTTGGGCGCAGGGNCNCAAAN 

GGGTGGCANTTTCCCTCCTTGGAGN I I I I I I I I I CAAGAANTCCTTGCNTNGGGGAAAGA 

TGGTTACNANNATCCCGGMTTTCCAACCCCCTTCCCTATTTTTTTGGTTTAAGG 

Sequence 70 

CGCGTCCGGCACATTMAMAMATACTTATTTTTTATTATGGAAAGGTCTTGGAACATT 

CTGATAGTGAGCTTCCGGCATTCATTTGCTGTATCTGGCTTAGGAGATGCTAGGGTGGCA 

AGAAGAGGCACAGGCTTAGATGCGCTGGGTGGAGAGTTGGCTTTAGTAATGATGGTTGAC 

TCTAACGACTTGATTATCAGCTGTGCC I I I 1 1 I CTTCCTGCCTTCTGAGGTGTGTTGCCT . 

GCATCCTAATTCACGTACTGAGTAGCAAGCTAAGCAGGTTGTAGCTGGAGATTGTAAGM 

ATCCTGAATGGAAACCAAAGAAAGACTGTCACATCACATGATGTGCCCTTTTCAATCCCA 
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TGTCTCTTCCCAGTGGCATCCCAGTGCTGTCTCTGCCCCCTGCTGCTTCTGTAAAGATTT 
TCTGACACMGTMCTGCCTCATAGACCTTCCTTTTTATGAAATCCTGAGTTTTGGTTTG 
GGTACGTCCTTTTTAGAT 
Sequence 71 

GCGCGGCTGTCGCGAGGGCGGGGGTCGGGGCTGCAGGCGGGGCAGGGCTGGGTGGGGGCG 
CGCGACGCACCTGCCTGCTTCCTGCACGGGTGGNCCCCAAGGACTGCGGGGCCCCAGCCC 
AAAGCGGACCTTGA 
Sequence 72 

CGCCCCGCGTCCGGGCGGCTGGTGGGCGACCGGGCGCATCCTCATTGCATGTGCGGCGGC 

CCTACCTCGGCCCTGGCCTGACCCCGGCGGCCCTGCCCGCCCCTCCCTCCAGCATCATGG 

CCAGCCCAAGAACCAGGAAGGTTCTTAAAGAAGTCAGGGTGCAGGATGAGAACAACGTTT 

GTTTTGAGTGTGGCGCGTTCAATCCTCAGTGGGTCAGTGTGACCTACGGCATCTGGATCT 

GCCTGGAGTGCTCGGGGAGACACCGCGGGCTTGGGGTTCACCTCAGCTTTGTGCGCTCTG 

TTACTATGGACAAGTGGAAGGACATTGAGCTTGAGAAGATGAAAGCTGGTGGGAATGCTA 

AGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTACCGATCCTTGCTGGTCCTTTGCAGGG 

AGAAGTA 

Sequence 73 

GCCCCGCNTCCGGAAATGTCCGA I 1 1 I I I 1 1 I AATTTAATGAAATTGTTAATGAGGAAAA 
ATTTTTAATATAGGTCTTATCTACCACACATCCCCATAGATTTAAGGATTTTAATAGAAA 
GTCATGATGTATGTATTTAAGCCACGTTAAAAGAAAAAATATAACTATGGACCGGTATTC 
AGTGAATACAGTTTCATGG I I 1 1 I AATTCTTTCAAAGCACATTAAAAATGGTGTGCTGAT 
MACCCCAAGTAAATTMCCCTTTTTCCGTATAAATCCATTTTTTGTTTTGAAGAGGGGA 
: MTTATATTTATTGNTGTTTACTGAATCCTGGTGTGAAAGCATATCAGATATGTATGAAC 
TGCTACTGCTGTACTTCCGATTTACGGACATCATTTTATTGCTATTTGTAGACCGTGATA- 
ACATGAACATGAGTCCTATTTATGTGGGCCTTCAGTGGATGGGCAGTGCCACTCANGTCT 
CTGGGGGTTTCCTCTCTTAATTTTAAAGTAA 
Sequence 74 

AGTCGCCNCGCGTCCGTGTGTTTCTCCTCGGTCCCCAACTCTACCTTCCCCAACCACAGT 

TCCTGTCCCCAGATGTCCTGATGCCCACCATGGCAGGGGAGCCCAATAGACTCCCAGGAA 

CTTCAAGGAGTGTCCAGCAGTTTCTGGCTATGTGTGACAGGGGTGAAACTTCCCAAGGGG 

CCAAGTACACAGGAAGGACTTTGAACTACCAGAGCCTCCCCCATCGCTCCAGAACAGACA 

ACTCCTGGGCACCCTGGTCAGAGACCAACCAGCATATTGGGACCAGATTCCTGACTACTC 

CAGGGTGCAATCCTCAACTAACCTACACTGCCACACTACCAGAAAGAAGCAAGGGCCTTC 

AGGTTCCTCACACTCAGTCCTGGAGTGGATC I I I 1 1 CATTCACCCTNCCACCCTNCCATT 

GNTCATCCTGTGTACCCACCATCTAAGCAGTCTTCATGTACCCCTGAGGTCAAGCTTGGA 

A 

Sequence 75 

CCCGCGTCCGGGCTGGCATGGCTCTATATAAGATTGTTGCANAAANTCCCTACTACTTTT 
GGTCTGTGATGAGCTTAATTATGCAATCTATATNGGCACAGGATGAAAACCTCTCAAAAA 
CAATGTTTCTGCCCCTTGCTGAGAGAATGGTCGAAAAAATGGTGAAAGAGGACAAGATAG 

• AAGCTGAGGCTGAAGTTGAACTTTATTATATGATCCTGGAACGTTTGGGAAAGTACCAGG 
AGGCCTTGGATGTCATCAGAGGGAAATTAGGAGAGAAGTTGACAAGTGAGATTCACAGTC 

. GGG AAAATAAATGCATGGCTATNTACAAN AAGCTGAGCAGGTGGCCAGAGTGCAATGCCC 
TTTNCCGGCGCCTCTTACT 
Sequence 76 

GNTGGAGGGAGCTTTGCTACTCTGCTCTTGGCATGACTCCAGGAI 1 1 I 1 1 1 CTGGAATCC 
AACCTCTGTCCTCTTAGGAGAAGGAACCTGTCCTTGGTTCAGATGGGTGGGCATGAGGAG 
GAAAATTTCCATTAGTGTAGAAAAGTGCTGGACAGAATCCGGTTTGGAAAATTACAAATC 
• CAGTTGGTCAAAATAGGCCATTTCCTATGTGTGACCTATTCGTGGTATGCCAACTGGACT 
GCTTCCTAAACAGGACGAGGAAAGTGAGGAATATTTTTATATGAAAGCCTTAGCCTGTCT 
GGCACCCATGAAAAAMCTATTTATGCACTCCTACTTTCACCCGTCI 1 1 1 1 GCATTCTCT 
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ATTTGTAGCACAACAGAGTTGAATGCCACAAAACACCCCGTTTATAGTGAGCTGTTTTCA 
GTGACCAATATCAGAAGGAGGCTTGCTTCTGGACTAGCCTACTAATTGCCAGCAGCCACC 
Al I I I ICATG 
Sequence 77 

GGAGAGTGCCTTGCCGGACCCTCAGGACGGAGCTGCTGGGCTGCTACAGTGACCAGGACT 

TTCTGGCCAAGCTGCACTGTGTGCGGCAGGCCTTCGAGGGGCTTCTGGAAGACMGAGTA 

ACCAGCTTTTCTTCGGGAAAGTGGGCCGACAGATGGTGACAGGCCTGATGACCAAGGCTG 

AGAAGAGCCCCAAAGGCTTCCTGGAGAGCTACGAGGAGATGCTGAGCTATGCCCTGCGGC 

CCGAGACCTGGGCCACAACACGGCTGGAGCTGGAGGGCCGAGGGGTGGTATGCATGAGCT 

TCTTCGACATCGTGCTGGACTTCATCCTCATGGACGCCTTCGAGGACCTGGAGAACCCTC 

CGGCCTCGGTGCTTGCCGTCCTGCGGAACCGCTGGCTGTCANACAGCTTCAAGGAGACGG 

CCTTGGCCACTGCTTGCTGGTCGGTCCTGAAAG 

Sequence 78 

CACGCGTCCGGAGAACGTGATTTCTCAGCCGAATGAGTTTGAACATACCCCACAGGAAGA 
■ TGACTTGGGGTTCAAGGAAGAAGATTTGGCTCCAGATCATGAAGTAGGAAATGCCTCTCT 
CAAACCTGAAGGCATCCAGAACTGGGATGACTTATGGGTCCAGAGAGAGGGTCTAGGAAA 
GCCTCAGCCTCGGGACAGAGGCCCCCGGCTCCTGGGTGAACCACGCTGGGGCCAGGCTAG 
TAGTGATCGGGCCGCTGTGTGTGGTGAGTGTGGCAAAAGCTTCAGGCAGATGTCAGATCT 
GGTGAAACACCAGCGGACCCACACAGGGGAGAAACCCTACAAGTGTGGGGTCTGTGGCAA 
GGGCTTTGGGGATAGCTCTGC 
Sequence 79 

CGCGTCCGCAAGMGATMCCCCAMCTCTTTTCTCAGAGAGTTTGTAGCCTAGTTTGGG 

ATAGATAAGATCCACATATTTAGTCATATAAGACTACAGGAGAGTAGAATAGATGCACCA 

GATGGTGTCGAATGAAAGTGGTACTTTGTAGACTATAAGTGCTGTAAATTCTAAAGGACA 

GGTTACTTTTGCCTGGAGTGGTCAAGAAAGATTTTATTTAAAATAAGGATTTGACGGGCA 

GACTTAGCAGTCAAAAGGGAGAAAGCGGGTAAAACAAATGTAAGCCATCATAAGAGTGCA 

TGTGGTTTGGAAGCATCAGGGAAAAGACTAGCCAACCTGAAGTAAAAGGTTCGTGCAAAT 

TGGGCAATCAATAGCATTAAGTTGGAAACAGCTTGGGGACAGACACATAAGAGGGCCAGA 

GTGTGMTAATTTCATCTAATACTTTATAGCACTTGGACATTTACAGAGCACTTTTCTC 

Sequence 80 

TNCTCTCTGCCCCCCCACATTCGTCCTCTTGATTCTCTGCTTCTCTAGCTCAGCCGCTGA 

GCTTCGTGCCTAGCCGCCACTAGTCCTTGACCAGCGTTCTGGCAACTCTTGCCTCCAAGT 

TCTTCCAGCTCCAGGCTGAGCCGATGGGGATTCAGTTTTCTGACATCACAGCTNAGTTCT 

TGATTTCTGCAGCAAAACCTTCAAGGCTTCCATCACCTCGGCTCTAGAGTCCATCATGCT 

CCTCTCCTATGACCTGCTAGGGCTGATCAAACGTTCTGTCTCCTGCCGTGCCCTGCCCTG 

CCCTGCCCTGAGCTTCGCTTAGCCTGTTGCAGGCTTTGTGTTTTCCTTCTTGTGCTGTTG 

GACCACGCAGCTCCTTCTACCCATAAAACCCCCTTCTCTAGGTCGGTGGAAATCTTGGTC 

ATCCTTCCGTGTCTAGNTAAACTGGNCACCTCCTCCACGAAAGCCTTCTAAAAACTCCTT 

CTCAGGGGAACTGGTTTTCTTTCT 

Sequence 81 

CACGCGTCCGCAAAATAGCCCCACATCCNGGCAAAAGGGGCCTTTCCCTTGGCCCAGAAG 

AAAAAGGAACAAGTGGAGTGCAGAAGAAAATCTGTACTGAGAGACTTGGGCCTAGCTTGT 

CTTCCAGTGAGCCAACCAAGGCTGGTGCTGTCCCATCCAGTCCCTCGACGCCAGCACCAC 

CCAGCGCCAAACTTGCCGAGGACTCAGCTCTGCAGGGTGTGCCCTCTCTGGTGGCAGGTG 

GAAGTCCACAGACTCTTCAGCCGGTATCCAGCAGTCACGTGGCTAAAGCTCCCAGTCTGA 

CCTTCGCTTCCCCCGCCAGTCCTGTCTGCGCATCAGACAGCACTCTCCATGGGTTAGAGA 

GCAACTCTCCCCTTTCACCACTGTCCGCTAATTATAGCTCACCTTTATGGGCTGCAGAGC 

ACCTCTGCCGCAGCCCAGATATCTTTTCAGAGCAGCGGCAGAGCAAACATAGGCGCTTTC 

AGAATACCCTAGTAGTCCTACATAAAATCTGGGTTGCTGGAGATCACTTTTGAAAACCAA 

G 

Sequence 82 
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ACGCGTCCGCACCCCTGTGTCCAATGACATGTGCACCCAGGTCGCCAAGCGGCCTGTGGA 

CACCCAGGCCTGTAACCAGCAGCTGTGTGTGGAGTGGGCCTTCTCCAGCTGGGGCCAGTG 

CAATGGGCCTTGCATCGGGCCTCACCTAGCTGTGCAACACAGACAAGTCTTCTGCCAGAC 

ACGGGATGGCATCACCTTACCATCAGAGCAGTGCAGTGCTCTTCCGAGGCCTGTGAGCAC 

CCAGAACTGCTGGTCAGAGGCCTGCAGTGTACACTGGAGAGTCAGCCTGTGGACCCTGTG 

CACAGCTACCTGTGGCAACTACGGCTTCCAGTCCCGGCGTGTGGAGTGTGTGCATGCCCG 

CACCAACAAGGCAGTGCCTGAGCACCTGTGCTCCTGGGGGCCCCGGCCT 

Sequence 83 

CCCCGCGTCCGCTCTTACGCATTACTCTATGTCTACTGTTATGGGTGTGTAATTTTATAC 

CATAGATGTTTACTCTTTAAACAGACACTTCTAGTCTGTTTTATTTCATGTGTCTGGGAG 

CGGATAAAGTGTGAGGTTCAGGGAGAAAGAGAGGTCTGTCTCAATGCCTTGGCACGGCAT 

GAAGACAATCTCCCCTCCTTGTCCCCTTTCCCTGCTAGCTCCTGATGACTGACAGATTCA 

CAGCAGAACAGAAAGGACTGGGAAGGGATGGAGGTGGGACATCTGGCACTGACCTTCAGG 

GGCTGACCCTGTGGGGGAACATCTGCCCTGAAGAGTTGGAGCCTTCATGTGATGACACAG 

AGCTGMGTGTGATATTCGGGAGGGGATAGAGAGTGCTTGGAGGTTTTCTGATTTTGAAG 

AATCCCAAGTCAGTC 

Sequence 84 

GTCCGGCCGCTTCCGGTCTCCCTCCCGGGCCGGCGCTGGCCTGACTGCGGCCCCGGTCCG 

TAGCACTCCGCCCTCCGCTTCTCCCGCCCTGTAGCCGCGAAGACTGCTTCAGCCTTTCCC 

TGTGCTGCCCCTGCCGCGCGATGGAGACGAGCTCGAGCTGCGAGAGTCTTGGCTCCCAGC 

CGGCGGCGGCTCGGCCGCCCAGCGTGGACTCCTTGTCCAGTTAATGTGTTAAGAGCCATT 

GACATTTGAAGATCATCAGAAGTGAAGATAAAACATCTCAAAAATTATAATTGCCTCCAC 

TTCTCATTCAGAGAATTCAGTGCATACAAAATCAGCTTCTGTTGTATCATCAGATTCCAT 

TTCMCTTCTGCCGACMCTTTTCTCCTGATTTGAGGCCCATGCAGTCCAGTTCGGGAGC 

TAAGT 

Sequence 85 

CCGCGTCCGCGTGAGGTGTGGGTGTTCGTTTCTCAGGTAAAACATGGCTAAAAGCTTACG 

GAGTAAGGTGGAAAAGAAAGATGCGTGCTGAAAAGAGAAAAAAGAATGCCCCAAAGGAGG 

CCAGCAGGCTTAAAAGTATTCTCAMCTAGACGGTGATGTTTTAATGAAAGATGTTCAAG 

AGATAGCAACTGTGGTGGTACCCAAACCCAAACATTGCCAAGAGAAAATGCAATGTGAGG 

TAAAAGATGAAAAAGATGACATGAAAATGGAGACTGATATTAAGAGAACAAMAGACTCT 

TNTAGACCAGCATGGACNGTCCCMTTTGGNTGAACCCAAAGGCAAANAAAAANGNTTTG 

ANGGCAAACCGANNGAAAAAAAAANGGGGAAAACCAACCNAAANCCCTTAAAANGGGCCA 

ANGGGGTTTGGCCNCTGNNMTNNTTTNAACCCNTTTGAAAACCCCCCCTGGNNGANACC 

NCCCGTAAAAAATNTTCCCCCCNN I I II I I I I I I 

Sequence 86 

CCACCGCCGTCCGAGGAGGGATCACCAAGCCGTGGGCCATGAAAGTCGGGGGGGGGCACC 
GCAAGCTTGAMGCTTCATCATTGACCTTTCNCAAGAAATTACCGGGGCCAAGGCCGCTT 
GTTTCNAAAACCCCCATGGAAACNAAAACGANGGGGAGGCANGGGAGAACCACCCCCTTG 
GACI I I I I I I INATTTCTTCTTGGACCTACCGAAGGAGGGCACAAATTGCCGGGAAGATT 
GCTTGCCTTTCCACCTTGGGACAAGGGGATCCTGGGACTTTTCCGGCCCGGGGTTCCCNT 
TCCCGTTGGCCCGGCAAGGGATTGGGTCNAAACAATTGACCCAAGGGAAGAATCCCCGGG 
GACGGTCACMCCGGGGNACCAAAGGNAAAGCTTTTTTGGGANGGGAACCC 1 I I 1 I I I I A 
ATT 

Sequence 87 

CCGGGTCCCTCCCTGCGGAGCCGCTGGTCCGGCTGGCGGAGATGTGACCGCGGGCCCGGC 

CGGCCTGCCTCAGGCGTCGCGTCAGCTCCCGTGTCCGTGCCCTTAACCCACACCGATGGC 

GGGATCCGGCTGCGCCTGGGGCGCGGAGCCGCCGCGTTTTCTGGAGGCCTTCGGGCGGCT 

GTGGCAGGTACAGAACCCGCTGGGTAGCGGCTCCTTCCNCTTCGGGGTATTCGGGTTTCG 

CTTGTrGGGGMCCCTTGGTTGGCCCCCGGGGCCCrrAMNCAGTTNTTGCCGCCAGAAA 

CCACCGGGGTTGCGGCCTTTGCCGCCNAGTATGNTTTCCGAAAAAAAAGGGCCGGGCTTG 
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NACAAGTTGCANGGGTCACAAAAACATCGTGAATTTTGATNGGAGTGGTTACAATCCAC 
Sequence 88 

CGTCCGTTTAATTATAACCTAGATTGTCTGGGCAACGGCAGGAACGGAGTGCCACTGTGG 

AGCAGATAACTGCAGTGGTTTTCTAGGAGTGCGGCCAAAGTCGGCATGTGCGTCAACAAA 

TGAAGAGAAGGCAAAAAATGCTAAGTTAAAACAGAAGAGACGAAAGATCAAAACAGAACC 

AAAGCAGATGCATGAAGATTACTGTTTTCAATGTGGAGATGGTGGAGAGCTGGTCATGTG 

TGACAAAAAAGACTGTCCCAAAGCATACCACCTCCTATGCCTTAACCTGACTCAGCCACC 

ATATGGAAAGTGGGAGTGTCCGTGGCATCAGTGCGATGAGTGCAGCAGTGCAGCTGTTTC 

CTTCTGTGAATTCTGTCCACATTCATTTTGTAAAGATCATGAAAAGGGGGCCCTGGTTCC 

CTCTGCACTGGAAGGCCCGCCTCTGCTGCTCGGAACATGACCCCATGGCTCCTGTGTCAC 

CAGAATACTGGAGCAAGATAAAATGTAA 

Sequence 89 

NGTCGCCCCGCGTCCGTAAAATGTTAAGTCCCCTAAAAGTGAATAAATTTTAAATACCTA 

CTTTTAMATVTACTGTCTTCTAMTTGACATMTTGCTTTTCTTACCAAAAGAAGGAGAG 

GTTCCCCTAATTCCTTTTGGGCCATAGATCCGCTTTTAGGATCTGATTAAAGATGTGGAC 

TTTCACCGCAGGAAGATACCCATGGGCACACCATTCAAAATGTATCAGATTTCAACGGTT 

TTACAAACTCCCCCGGGATTAAGTCAAATGGGTGGATATTACAGCTGTTTTCATCAGATA 

TGGTTATTTGTTGACAACCATCAGGACAACAATGTTTATAGATGGAAGGATAACTTCCTG 

GTTTTTTTCCACATTGAATGTGGCTAGTTACATATCTCAATTTAAAATAAATTGTGGAAA 

GCCAAAAAAGTATGGTCAAGCTAACCTTGGGGTGGCTTTTACCTGATGCCTACAAGCACA 

GAAAAATAG I I I I IAA 

Sequence 90 

CCGCGTCCGATTATGCCAAGAGAAGGTATTACTTTAAGATGTGAAAAATGTAAAATGGAA 

AATTACATTACTAAGAAAAACAAAAAAACACAAACTGNNAAATTAGAANTGAAAAAACAT 

TGCCACAAATGCAACGCACATACAAGTCATAAAGAAAAAAAATAATTTAAAATGAATAGA 

AAATATTTAAGACTTAAGCTTTGAAGAACACAATTTAGATGCAATAATTTCATTTTCTCA 

ACAAACAAGATTATGAGTTTCTAATCTTCAAACAAGTGATGGAATAGTGTTTCTTGAAAA 

AGAAGAAATCCATTTATTTGTTGATTCAAGATATATAGAAGCTGCTCAAAAAGATGCAAA 

AAATGTACAAGTTCATTTATTGACAGCAGCTMTTTAAAAGATTTTGTAAGTAGTAAAAA 

TTACTTAAAAATTGGTGTTGAAAAAGMTACTTAACTTTAGCTGATTTTAAAAAACTTCA 

AGCTTGATTTCCAAGTGCAGAATTTGTCAAATCAATGCGCAAAAATTAAGACTTATTA 

Sequence 91 

CCGTTGTCCCATATATCTTGTTCCAGCAGCCATATATCTTGNGGTCTACACGCCTAAAGC 

ATGATTTCCCTTGAAGTCTTGGGGTTGNTTAAAGGAGAGTCCCTTCAATATAAAACCTCT 

GAAATATTAGTGAGMTGGCTCACTMTGTGMCMTGTTTAAATTATTTATTTATATAT 

AGAATTACTGAATATTAGTACTGGGAAAATTTATAGAAATCATCTAGTCTTACCCTTCAT 

CTTACATATAAGAAMATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 

TTGAGCATGAATGTATGAGCCTAATTAGAGAAACAGAAAATCAGCATGTCAGTTTTCCTT 

CATTCAAAATAACATAGTCTTTCTAAGCAGTCATTCTGGGAG 

Sequence 92 

ACCACGCGTCCGCAAGGCCCGCCCCTTACGTACTCGGAGCTCGGATCCCAGTGTGGACCT 

GGACTCGAATCCCGTTGCCGACTCGCGCTCTCGGCTTCTGCTCCGGGGCTTCTTCCCTGC 

CCGCCCGGGGCCCTGACCGTGGCTTCTTCCCCGGCCTGATCTGCGCAGCCCGGCGGGCGC 

CCAGAAGGAGCAGGCGGCGCGGGGGCGCGCTGGGCGGGGGAGGCGTGGCCGGAGCTGCGG 

CGGCAAGCGGGCTGGGACTGCTCGGCCGCCTCCTGCCCGGCGAGCAGCTCAGACCATGTC 

GCCTGAAGAATGGACGTATCTAGTGGTTCTTCTTATCTCCATCCCCATCGGCTTCCTCTT 

TAAGAMGCCGGTCCTGGGCTGAAGAGATGGGGAGCAGCCCGCTGTGGGCCTGGGGCTCA 

CCCTGTTCACCTGTGGCCCCCACACTTTGCATTCTCTGGTCACCATCCTCGGGACCTGGG 

Sequence 93 

NCGCGTCCGCCAAGATGGCGTCCNTCATGGAAGGGCCGCTGAGCAAATGGACTAACGTGA 
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TGAAGGGCTGGCAGTACCGTTGGTTCGTGCTGGACTACAATGCAGGACTGCTCTCCTACT 

ACACGTCCAAGGACAAAATGATGAGAGGCTCTCGCAGAGGATGTGTTAGACTCAGAGGAG 

CTGTGATTGGTATAGACGATGAGGACGACAGCACCTTCACAATAACTGTTGATCAGAAAA 

CCTTCCATTTCCAGGCCCGTGATGCTGNTGAGCGAGAGAAGNGGA 

Sequence 94 

ACGCGTCCGCGGACGCGTGGGTGCGGGCCGGCCNCCCTGGACGAAAGAAGAGGGCCCCTC 

CAGGCCAGTCTGGGCACCCTGGGATAGCGGCTGCAGCCAGGCATGGCCGACTCTGCACAG 

GCCCAGAAGCTGGTGTACCTGGTCACAGGGGGCTGTGGCTTCCTGGGAGAGCACGTGGTG 

CGAATGCTGCTGCAGCGGGAGCCCCGGCTCGGGGAGCTGCGGGTCTTTGACCAACACCTG 

GGTCCCTGGCTGGAGGAGCTGMGACAGGGCCTGTGAGGGTGACTGCCATCCAGGGGGAC 

GTGACCCAGGCCCATGAGGTGGCAGCAGCTGTGGCCGGAGCC 

Sequence 95 

CCCCGCGTCCGAGGTGACCTCCTTGGCCCAGATCATCTTAGAGCCAAGAAGCAGGACCAT 

TCGTGGTTTTGAGGCCCTGATTGAAAGAGAGTGGCTGCAGGCTGGTCACCCATTCCAGCA 

GCGCTGTGCACAGTCAGCCTACTGTAACACCAAGCAGAAGTGGGAGGCTCCTGTATTTCT 

TCTCTTCTTGGACTGCGTGTGGCAGATCCTTCGTCAGTTTCCCTGTTCTTTTGAGTTTAA 

TGAGAATTTCCTCATCATGCTCTTTGAGCATGCTTATGCCTCACAGTTTGGAACATTTCT 

GGGCAACAATGAAAGTGAAAGATGTAAGTTGAAGCTACAGCAGAAGACGATGTCTTTGTG 

GTCCTGGGTTAATCAGCCCAGTGAGCTGAGTAAATTCACCAATCCCCTCTTTGAAGCCAA 

CAACCTTGTCATCTGGCCTTCAGTTGCTCCGCAGAGTCTTCCACTGTGGGAAGGTATTTT 

CCTACGTTGGAATAGATCCTCTAAGTATTTGGATGAAGCATATGAAGAAATGGTTAACAT 

CATTGAATATAATAAAGAATT 

Sequence 96 

CCGCGTCCGGTTTNCCTGTTGGTTAGGCTGGTCTTGAACTCCTGACCTCACGATCTACCC 

ACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCCTGAGCCACTGCACCAGGCCCACCCTG 

TCTCTATTTTCTAAAATAATAAATCTGATTTTAATGTGGCTGGATATAAATCATATCACA 

GTTGGATTTGGAAGTTTGGGTTTTATTCCTAACTTTGATGGGMGCCATTTTAAGCAGAA 

AGATGATTTTAAAAGACCACTATATTTTCTGTGTGAAGAATGAACTGGGAGATTTCATAG 

TATTATTAACAAAAATAGAATAGTTGGGGATCTGGTTTGGCTTGGGAAATGGAGGAAGTT 

CAACTTTGGGCATGCTCCATTTGCATTGCCAAGACATTGCAGCAATTGGAAGTGCAGTCA 

GAGAGCTTAGGAGAAACACTTGGCAGATGGACATAGAGAAGTAGTACTCAAAGCTTGTGG 

ACATTGATTAAATAATCATACAGGAGTATGGGCTGACAAAAGATTNCAAAGAGAAAACCT 

Sequence 97 

GTCNCCACGCGTCCGGGACTCTCGGCCCTGGAGAAGGAGGTGGACTTTGACTCCGACCCC 

ATGGAGGAGTGCCTGCGGATCTTCAACGAGTCCACCAGCGTCAAGACGGAGGACAGAGGC 

CGGCTGGCCCGGCAGCCCCCCAAGGAAMGAGTGAGGAGAAGGGGCTTTCGGGTCTGACC 

ACTCTGTTCCCCGGGCAGAAGAGGAGGATCTCCCACCTTTCCAAGCAAGGCCAGGAGGTG 

GAGCCCCCGAGGAGGGGTCCCGCGGTGCCCCCGGCCCGGCCCCCGACGGCGCAGGAGGTG 

TGCTACCTGCGGGCCCAGCAGGCGCAGAGGGCATCGGCGAGCTTGCTGCAGGCCCCCGCC 

AGGCTGGCAGAGAAGTCGCCCTNCGTCCACATTTCCCGCCCCTGGCGAGAA 

Sequence 98 

CGCCNCGCGTCCGGCAAAGCAAAAGGGAAATTATTTGGTGGATGGTAGCTCAAAATTGGA 

ACTCTTGTTCTMTTCAGTTACATTGGCTTTACCCTCCTTAGATTTTTCATCAAAGGGCT 

GTCCCATTGCMTCTTACTAAAACATTTTGTTAAAATAMCTCTTTTCCTTTTTATATTA 

ATMTTAGGCTTTTAAATAAAGATGTTATTCCTTTAAAATGGTGGGCTTACCATCATTGA 

AGATGTCACTCAGGTGGCCTTGCTTGATCAAAACGCCTTTTTTAAAAACCAAGCTTTAAA 

MCATGTTTATMTTTCATGAAGTACATATATATTGTTCCCATAGTCTTCAGCTTTAAAA 

CTATA MTATGCCCAMTTTTGTTATTTGCCCTACTTTMGTAGGTTTATTGNGTTTGTT 

TTTTTCAAGTACTTGTTTTTCTCTG ATAAG ACTCAG GAATTCTG AAATGTG AAAATG NCT 

CAATT 

Sequence 99 
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CNCGCGTCCGAMTCGTTGCTACCAANTATTCAAAACCCTTTGAGTTTACATACTAGTTA 

CCTTAAAAATTANTNCCTGACNCTCNTGANTTTGGGNGGAAAGCCCTTGTNTCNNCTCTC 

TNATGNACTCTCATGGGTTTTTTTGTATGATTTGMTATNAATGTGCCTAAAGMTTTTT 

GCTCTCTTAATCTATGNATACATACTTGAACAAATCATTCTTGCTTAACTGCTGATCTTT 

TGTAAAACTATTG 

Sequence 100 

GCCCCGCGTCCGGCTGGAAGCAACAGTTTGGCAGCCTGGGGTACACTCAGGTTATTCGTT 

ACAACTATTATTATTTG ATGTC I I I I I I I AAACTCAGGTCATCCACTTTTGACTGTCATC 

CATGGMGAGCTCTTATTAAAAGCCTCAGACTTTCGGGACCTATGATTCTTTGGCACAAC 

CTTTTGGAAAATTCTTAAGCAGGGATGAAGCAAACTTGATTGGAGTTGGGGAAAAAGAAG 

ACAGATTAGTATTTTTCATGCTGACAAAAAATAGCTGCTATGACTTTTCCCGCAACGTGG 

ACAG GGGC CAAGTGAAGCTGAACTGGTCATGGTCTGTCGCCCAGTGTTCCCTTGTCGTCG 

GCGATTTTGCCCCCGACCCTTCTTGGTGGGCTTAGTGGTGGCAATCTGTCTCTTCTACCA 

GACTCTGACCCTCCGAGGGTCGAGGAAGCTCACAGCCGCTGCCCCTGGGGCTGTCCCACA 

CACATCCACTGAAA 

Sequence 101 

CCACGCGTCCGGGCGTCTCGGCTCTTCTGTATCTCCCTGGCCTGGTCGCTCTCGGCTTCT 

GGGCCTCGCCCTTCCTGTCTGTGAAATGGACTCTGGGTGAATCCAAATGGGATCGTCTCG 

GGCTACGTCCTGTCCCTCCGGGACTACAAGTCCCAAGGTGCTCGAGGCGACCTTGGCTCC 

CCCTCCCCACCGGGACCCGCCTCCCTCCAGCCCAAGTCACGTCGTCTAACCTGTTCCCAG 

CTCCTGCCCCGCCCCGTTCTCCGCTCCCCAAGCCGGAGCCCGAGCTGGAGGAAGCCCCCA 

GGTGCCAGGATCTGCTCGGATCCGNGCCCGCTCCGGCCGGCACCATGGACAGTGAGGCAT 

TCCAGAGCGCGCGGGACTTTNTGGACATGAACTTTCAGTCGCTGGCCATGAAACACATGG 

ATCTGAAGCAGATG 

Sequence 102 

CCACGCGTCCGGTCCGGGGTGGAATCACGTCGCTGCGGCTGCCGACGACCCACACCCGGC 

CGGCCGCCTCCGCAGACCCACCTTGGCCGCGCGGCAGGGGGCGCGCAGAGCCCCGAGGGA 

GCGAGTCCCCGCGCGTGGCAGCTCGGCGGCTTCTCCCTTCGGGAGGTCCGGCTCCCGGCT 

CTCCGGACCCGCCTGGCGTCCTCGCCTGCGGCGGGGCGGACGACAGCGGCGCCCAGGAAT 

GGCTTCGGCGGGCAGCGGCATGGAGGAGGTGCGCGTGTCGGTGCTGACCCCCTTGAAGCT 

GGTCGGGCTGGTGTGCATCTTCCTGGCGCTGTGTCTGGACCTGGGGGCGGTGCTGAGCCC 

GGCCTGGGTCACAGCTGACCACCAGTACTACCTTGTCGTTGTGGGGAGTCCTGCCGGAAA 

CCCGCCAGCTTGGACATCTGGCACTGTCGAGTCCACGCTCANCAANCGATTGGCAGATTG 

C 

Sequence 103 

NCGCGTCCGAGAMTTGCMTTTTTTMTTTTTAATTTTAAGAGGAATTCGTGCCAGAGA 

GAACTATTAAGAAAGGGGTATATCCAGTCTAAGGATTATTAGGCTCAAGTCCATGAATAG 

GCTCTGGGMGTTTGTAAACACTTGGAAATTATTTGCAAAATGTGTGTGTGAATGTGCTT 

TACCTTANAGAGTTCATGMTTTCATTAGATTGTTGAMGAGTTTTAGTATTAACAAAGG 

AAAAACAAACCACCACCATCACATAACAAAACCACAACAGTGATTTAATCTTTTCACCTA 

ACAATAAGTAAATTGAGGCTCTGATGGCTAAATTAATAGCCTGAGGCTACACAGTCAGTG 

GCAGAGCCCAGGGTANAGAGAGAACCAGCACAAGCCATTGTGGGAGCCGAGGGTAAAGAG 

AGAGCtAGGTGTTGTACCTTAGTAAATAAATCAGAA 

Sequence 104 

GNGTCGCCCCGCGTCCGGAAGGTGGAGACCGCTTACCCTGATCNGGGATGTATCGGCTGC 

GGGTGCGCAAGGCAGTCCAGGAGTGACCTGGGGCTGTGGAGAGCGACCCGTGGCCTTGTG 

TTTCAGAGTTTACCACCTAGGATGACTTCAGTGACTAGATCAGAGATCATAGATGAAAAA 

GGACCAGTGATGTCTAAGACTCATGATCATCAATTGGAATCAAGTCTCAGTCCTGTGGAA 

GTGTTTGCTAAAACATCTGCCTCCCTGGAGATGAATCAAGGCGTTTCAGAGGAAAGAATT 

CACCTTGGCTCTAGCCCTAAAAAAGGGGGAAATTGTGATCTCAGCCACCAGGAAAGACTT 

CAGTCCGAAGTCCCTTCATTTGTCTCCTCAAGAACAATCTGCCAGTTATCAAGACAGGAG 
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GCAATCCTGGCGGCGAGCAAGTATGAAAGAAACGAACCGGCGGAAGTCGCTGCATCCCAT 
TCA 

Sequence 105 

CGTCCGCGCAGCGCTTGAATCCCGTGGCCTAACCGTCCCTCGGAAGACCGGTCCCGCTCG 

GGAGGCTCTGCAGTCGCGCCTGGGGTCAGGGCCGGGGGCGAATGTGGCTCGCGTTCTAGG 

CCTCCCTGGGTTGGAAAAAGACTATGTTAGCAANGTGTCACGCCATGCTTTTGCCAACTT 

TGCMTTAAAGGTTGACATTCCTGCATMGCATTTCTCTGTGAAAATGTCCTTGCCTCTT 

ACAGAGGAGCAGAGGAAAAAGATTGAAGAGAATCGACAAAAGGCTCTGGCCCGCAGAGCT 

GAGAAGTTATTGGCAGAACAGCATCAGAGGACTAGCTCGGGCACCTCCATTGCTGGCAAC 

CCATTCCAGGCCAAGCAAGGCCCATCCCAAAATTTCCCAAGGGGAGTCTTGTAAGGCCAA 

GTGAGCCATTGGTGTCATTTTCAAGCAACAGAATCTCAGTAGCTCATCTAATGCTGACCA 

AAGACCTCATGATTCCACAGTTTTCANGCAANGGGAATATGGAAA 

Sequence 106 

CCGGCCCTATCCCTATATTGTTTGCTTGTTTGGGATAACCTAAAA I I I I I IATCCAGTTT 

ACTACTMTTTGTTTTACCTGATGTATCTTCTCTTTCAATAATTTTATGTTACCTTCTGT 

TTAGAATAATATTTGCCACAGATATTTAGGTTTAATTCTGTGTTTGAATGATTCCAATGC 

CTTT CTCTACCCA CTTTGAACACTTCATCCTGGAATGGTTGGCTGATGTATGTCTCTAAA 

CAA I I 1 1 I I I I I IAGGAGAAGGTATGTGGGTAATGTAATTCCTAAACCTTTGCTTTTCTG 

AMAATCTTTCATTTGCCTTTATACATGACCAGATTTACTGGGTATATAGATTTGTTGAT 

GAAAAAAGGTAAAAAGAGCAACTTTTGACATCCAGAGGTTTGTCTGGCACTCACAGCTAG 

CCCCGTGTTATTCTCCCTATT 

Sequence 107 

GCGTCCGTCTNAACCCTAAAGCTAAAAAGTCATTGTGAACCTTTNGGTCTGATGCTAAAG 

AAGGGAAAACAGGTACAGGAAATCCCATGTGGATGCTTGCTTNCAGGATTTCCTGCCATG 

ATTCCAGAATCCCACAGCTNCAACATGATTGCAAAAAGACTCCCTGCTCATnTNCCTCA 

GCATGCACAGCGCTGTCCTGTCTCAGTTGCAACTCGACAGAGCCGCATTTACTCCAGAAC 

CCAATCCACACACCTGCTCATCCTGCCCCGAGAGGAGTGCCTGAAGCCAATAGCAGGGAA 

CTAGAGCAGACTTGGGTGGATCTTCATTGGATATTAGGTATCTTGCCCTAGATAGGCAAG 

CAGTGGCCTTACAGATGCTGACAGATGATCTGATTAGATGCACAGNTGCTGGGTGGCGTC 

TGGGGCCAGTCTATTGGNCAGTTCTGGGAGNGGGAACTATTTGGGCTCTGCAAAGATG 

Sequence 108 

CGTCCGCTCCCTGGCCCTGCTCCGGGAGCTGTCGTTGTCTCCGCCAGCAGCCCTGTGGCT 

GCAGGAGCGCCAGGCCCAGCTTCGCCACTCGCTGCCCCTGCAGAGCTTCCTGCTGAAACC 

TGTCCAGCGCATTCTCAAGTACCATCTGCTGCTGCAGGAACTAGGGAAGCACTGGGCGGA 

GGGCCCAGGCACTGGGGGTCGCGAGATGGTGGAGGAAGCTATTGTGTCCATGACAGCGGT 

TGCCTGGTACATCAACGACATGAAGCGCAAGCAGGAGCATGCAGCGCGCCTCCAGGAAGT 

GCAGCGGCGGCTGGGTGGCTGGACCGGACCAGAGCTCAGTGCTTTTGGGGAACTGGTGTT 

GGAGGGCCGCGTTCCGAGGAGGCGGANGGGNGGGTTGGCCCCCGGCTACAAGGGGGT 

Sequence 109 

AGAATTGTGTATGCCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCA 

AAGAAGATGGAACCAGACTGGAATTCTGTCTCCAGAGAGAAACCCAGCTGTTTGGGTCAA 

AGACAGATGCTTCAGACTTGGGTGGGAAGGTGAAAGATGGCTATTTAGAAAGCTGGTGGC 

ACGTTTTACATAAGGGAATGTCAGATGGGAGATGCTAGTTGCCATTTTAACAAAGCAGGT 

AAATCGGTAAATTTTAAACTCTGTCCATGTTCTGTTAGAACTCAGGGACAAGGGATCCAT 

GAAAAAG 

Sequence 110 

ACGCGTCCGCACGGAGAGAACTGGNCCTGGAGCGGGGGCGCGGGGAGGGGGGCGTCGTCN 

TGGGTACMTTGCGCANGGGCAAAGGTCGAGAGGTCGCGCTGNCGCCGTTTTATTTGAAG 

ACATCGTCCAGTTCTGACCATGGACTCNCAGCCATCGGCCCTTAGTTTCCATTCCCTCTA 

G NGGG CCTTCNGAGGGNTCTACTGACGTACCTCCTTCCCTTGGTACCGGACCGGGGAAGT 

GTTTTCGGGCGCGGGAGGTTCCGCATGCCCAGGCCTGGCCAGGGGA 
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Sequence 1 1 1 

CGGGCCCTTAGTCCAAGCCTTGATCGGCGACTAAGTGACGGCAGTGACTGCCGCCATGCC 

GAGCTGGACGGAAGNCACTTCTGAGAAGGGCGGAAGTGTCTCGGGCTCCTTAGAGGGAGG 

ACACCATATTAGTGCCAGTGGGGAAGTCACCGGGTGGAATTACTTCTTTGTGGAGTTTGT 

GCTGTAGCGACAATGAAAAACGAAGAGTCAACTTTTATAAAACAAAATAAAMTTAAGTC 

AAATCATGCCAACCTTTATTAGATCGGCTAGCAGGGTTAAACTTAATTCAAAGCCCCTGA 

TGAATCGGGCCTTCATTGCACCCCCAAAGGCTCCGCCACCCTGATT 

Sequence 112 

CGCGTCCGGGCGCCGGTACGCCTGGTCCCCGCGTGGAGTCTTTACTCAAAACAGCTCCCG 

CCTCAGGCCGAGATGAGGAGCCCTTCANAATAGCTGCTGTCTCTGGGNGGACCCGGGCGT 

CCTTGGCAGCCCAGCTGNTCTGGACAAAGCCCTGCCAGTCAGGCCTCGGCTGGCAGGAAC 

CATGGGAGAGGCTGGGGATGCTGCGCTATCGGTGGCCGAGTGGCTGCGGGCATTGCACCT 

GGAGCAGTACACGGGGCTCTTTGAGCAGCATGGCCTGGTGTGGGCCACTGAGTGCCAAGG 

CCTCAGCGACACCCGCCTGATGGACATGGGCATGCTACTCCCT 

Sequence 113 

TGTCGACCCCGCGTCCGCGGGANGTTCATGGAAACGCAGGACACGACAGAATTGTGTIMTG 

CCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCAAAGAAGATGGAAC 

CAGACTGGAATTCTGNCTCCAGAGAGAAACCCAGCTGTTTGGGTCAAAGACAGATGCTTC 

AGACTTGGGTGGGAAGGTGAAAGATGGNTATTTAGAAAGCTGGTGGCACGTTTTACATAN 

GGGAATGTCAGATGGGAGATGCTNGTTGCCATTTTAACAAAGCAGGTNAATCGGTNAATT 

TTAAACTCTGTCCATGTTCTGTTAGAACTCATGGACAAGGATCCATGAAAAAGACCTGTG 

ATGTTTCNTCTGGCGCTTTACTGGCCTGGGCACACCTACCAATCTTTTAGGATTTGACTG 

GTTCCATTACATTTCCT 

Sequence 114 

GTCGACCCCGCGTCCGTATCACTGTAATTTAAGGAAAGAAAACTTCAGTTCTGCCTCTGG 

ATACCAAGATGCCCATTGCTCAGTTCAGACAACTGATATTAAAATAAAGCTATGCTCCTT 

ACTTACTTCTTTTATTATAAACAAATTCCTTTGCTTTGGCTGATACTAGCTGAGTCATTG 

ATCATCATTGGTACCATGATATTGTAATCTATGCTGCTATTTGGCACAAGACTGAAGTTC 

ACACTACAGTAGAGAATACTATAAGATAATTTGCAATAAATACTGATAATAATAATACCA 

GATATTTTMCTMCTTTTTCTACC'TTTATTAATAGCAATCAGCACACTTGAATGTGTAA 

ATTTCACAGTAACTTTAGGCAGAACTTAAGCTCCAGGCCACATTTGTATAAGMCACCAA 

GTATTCAAGGCATAAAGTCTGTTGTAAGCCAAAAAAA 

Sequence 115 

AGTTCAGTCTGCAGCAGTCCCTGCACCCACTTCCCAGTTGCTTTCATCTNTGGAAAAAGA 

TGAGCCCCGTAAAAGTTTTGGCATCAAGGTCCAGAATCTTCCAGTACGCTCTACAGATAC 

MGCCTTAAAGATGGCCTTTTCCATGAATTTAAGAAATTTGGAAAAGTAACTTCAGTGCA 

GATACATGGAACTTCAGAAGAGAGGTATGGTCTGGTATTCTTTCGGCAGCAAGAGGACCA 

AGAMMGCCTTGACTGCATCAAAAGGAAAACTTTTCTTTGGCATGCAGATTGAAGTAAC 

AGCATGGATAGGTCCAGAAACAGGAAAGTGAAAATGAATTTCGCCCCTTGGATGAAAGGA 

TAGATGAATTTCACCCCAAAGCAACAAGAACTCTNTTTATTGGCAACCTTGAAAAAACC 

Sequence 116 

CCCGCGTCCGCACCAGGCCCGAGTCTTCCCTTCATGGAGGGTGACGTGAGCAGCAAGGAT ' 

AAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAGCTCTTCAGGAAGCTGCAGCAAGG 

TTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGACCGAAAGAATGAC 

AAGAAGCCACCACCTTATAAACATATAAAGGGTCTCCCTCTGTGACCCAGGCTAGAGTGC 

ATTGC.TGCAATTTTGGCTCACTGCAACCTCCGCTTCGTGGGCGCAAGTGATTCTCCTGCC 

TCCT GCCTCAGTCTCCTAAGTAGCTGGGATTACAGACATGAGCCACCAACGCCTGGCTAA 

TTTTGTGATTGGCAAAAMGAGATTTTGTGACACATAAAGATGATATGAAATTCAACTTT 

CAATCAAGTATCCAGAAAATTTTA 

Sequence 117 

CCACGCGTCCGGCCCTTGCCCCTGTCNCACANGAATGGACCCACGGCCCCACCCAGCGCC 
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GTCAGCGCCCGGCACTGCCACCCGGGTCCGGGCCGCTGCCTGCACGTGGGATCCGTCGGG 

CAGCCGGGGACAGAAGAGACCCCGCCGTTGGGACGCAGGGCAGAGCCGGCCACCTAGTCC 

CTTCCAGCCAGCAGAGGCGAGGGAAGGCGTCACTGCCCCGGCGGGGAGACGGGCAGGACG 

CCCTGCCCCGCACCAGCAGCCTCCGCCGGGGCGCCCTCAGCTCCCTGCTTGGCTCTGTCT 

CTCCACACCCGGCAGGGCCCGCGGGCTGCCCCAGCCTGGGGGGGTCGTGGGCAGCTGCTA 

CTCAGTGCCAACCCCGTGGGGCACAGA 

Sequence 118 

NAGGGAGTCGACCACGCGTCCGGTGCGGAGCAAGCATCACACCATGGCGTATGAGTGTTC 

CTCTGTGTAGACTCAACCTGCGCCTCGCCGTCCCCCATTCGCACACCCGATGCCCGGGGG 

TCGCTACGGACTTAAAATCTCCGCACCGCACCCTCCACCTCAGAAACGTTCCTGGATCCG 

AACACTGCCCCCTGACGACCTAGAGAGATCCCGGCTCCCAGCCCACTGAGTGGCTTCAGC 

CTCGCTGGTAGGTCTCTCCTCCAAAGCTCTGGAAACAGACTCCTGGGAGTGANGGTAGNG 

GGGGAGCNGCAGGCACCGCCCCCTTTCCCCAAGTCCNCCGCCCACTTCATCCCTCAGGCA 

CCTNCCAACTCCTGGCCTTNTCTGCACGAGGCGCCTGCCCCGGGCCCCGCCTACAGGGGA 

CCCAGCTCTTCTTGCAGCCATTGGAAGNTGATCACCTGGGAGGTGA 

Sequence 119 

CACGCGTCCGGTTTTACTGCTCTTTGCCATGTGGTAAAAAGAGGCTGAGACATATTTAAG 

AATTCCAAGAGGATATTATGTGTCAGAATTTCAGACACTGATGAGAAGI I 1 1 I AATTGTT 

CI I I I I I ATTTGATTTTGGAATTCAGGTGCACTCTATTCAAGTGCAAGGATATCAGAAGT 

I I I I I I 1 1 ATTTAAAAAAI I I I I I I I I CGAGATGGAGTTTCACTCTGTTGCCCAGGCTGG 

AGTGCAATGGCAGCTTACTGCAACCTCCACCTCCTGGTTCAAGCGATTCTCCTGCCTCAG 

CCTCCCCAAGTAGCTGGGGATTACAGGCACCGCGCCAACACACCTGGGCTTATTCTAATT 

TAAGTAAGAAAATGGGAAGTCTTACCCATNTTTGGTCAAGGCTTGGGTCTTCGAACCTNC 

TGACCTTAANGGTGATNCCACCCCANCTTTGGCCTCCCAAGCCGTGCTNGGGATTATAGG 

GCATGMGCCCACCCANGCCCGGNCCCAGGATTTTTATATTTAAGCCCTTCTTGCTCTTN 

AAAAAAAAAAAAAGGT 

Sequence 120 

NGTCGCCNCGCGTCCGGGAACCTACCGGTACCGGCCGCGCGCTGGTAAGTCGCCGGTGTG 

GCTGCACCTCACCAATCCCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTC 

TCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGG 

GGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTMTATTTTTGGGGACAGATTTGTGA 

TGCTTGATTCACCCTTGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAA 

CTGGAACCAGCCAGTGAATCTTAAATGNTCACTTAAATCAGAACTTTGCCI^TTAANAAAG 

AAAATTGGGGNGTCCTGGGTTTA 

Sequence 121 

CCNCCCCGCGTCCGATCAATTCCTGG AATTTATGGTTATAACTTCGAAACAGAAGATGGA 

CTAATTTTA I I I I I ATCTA ATTTT ATTGTTG G A AATTT AG G AATTT ACG G A AAT ACT A AT 

TTAMTTATTrAAMAGATCATTAGAATCMGTAAACCAATTTTGATGGCTATGATTGAC 

TCAACMGAGCTAATrATCCAGGTAAAACAATAGATAAAATTTTTGCTAAAMGTTTTTA 

GMAAMCATTTTTAACAACAAAATCCACTTCAAGAATAATTGTCGGAGCATACGATGM 

GAGATGCTTTCAATTCMGAMTCCTTGATTTAGCTTACAAATCAAACGTAAAGGTTGCT 

GTATATGGNAGAAATTATGACMTCTTTTAGAAATGAATCAACGATTAGCACAAAAACAA 

AATCTTGAAATACATTATCCAGAA I I 1 1 I I GATTTCAGGCAAGCTAATAAAATCGATAAT 

TTTGTMTCTrMTTACATCMCACCTGGAGCGMTTTACCAMGATTTTTTAGAAT 

Sequence 122 

CGCGTCCGCGAAACTGAGAACCAGTTCTCCCGAAGCCGCGGGTCTCCGGCCGGCGGCGGC 

GGCGGCGGCGCAGGTGAGCAGGGCAGGGGGCAGCCGAGGGAGCGCGGGGAGCGGGGGCCG 

GGGGGCCACGTACGAGGGGCTGCAGGCCCAGCCGGGGCGGGACTCGCCAATCCTGCGTCC 

CCAGCTCAGGACGCGGACGCTGATCCGAAGCCCCTGGCCCCGGCTGGGTCAGCACTGGGA 

GAGCAGGCCCAGGTCCGCAGCCCGGGTGTGGGGCCCTCCCCAAATCCAGGGAAAGGATCG 
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TGGAGCGGGGTGGGGACTGAAAGCCATTTCCTTTCCCGTGAAGAAl 1 1 I I ATCAGTGCAA 
GTAACAAAATATT 

Sequence 123 

CGCGTCCGCTNAAAAAAATAATACCAAAAAAAGTTTTTGTAAAGACAACGCTCTCGCTGT 
GTTGCCCCGCCACTGTGGCCTCCTTAGCTTCTTCCCTGGGGCCTGCTGGACCTT TCCATA 
CTCCAGAAACTAAAGGGGGTCCAGGACCCTGCTTNAACCCTAGGATCCCGCATCl Mill 
1 1 1 1 I I I I I | I GGACGCAGGGTCTTGCTGTGTCCCTCAGGCTGGAGTGCAGTGATTCACT 
GCAGCCTCAAACTCGTGGGCTNAAGTGATTTCTTTAG CCTCAGC CTTNTAAGTAGCTGGG 
GACTACAGTCATACACCAACATGCCCAGCTAANTTTCCTTTTTTTAATTCTTGTAGAGNA 
TGTTTGAGACGGCTTGGGCTNTGTTGCC 
Sequence 124 

CCNCGCGTCCGTGCTGATAAAACTCCTTTGACCTGACGATTGCT CTAAGTC CTAATTGCC 

ATATTTATATTCCCATAGTAAGAGTGTTTGGAGATAGTGTTTGAGCI I I I I I GCTGGTGT 

TAAAAATGCATAATGAAAGATGGCACNAGAGAGGCATATTATATCCAATTCATGAAGTTG 

TTTGTGTTAACAGAAAGCTTATTTTMTCACTTMCATTGTTGATTTGTCTAATCACAGT 

AGCGCTATTGATTAGGAGCCTGACCTTTANATGGTTGACTTGTGAGTGTATTCAATATGG 

TGAMTAANGGTGTTTGATATATGGCTGCAGATTTTAGAAGGTGTCATTAGCAAAGGTAT 

ACGGAATAAAATANGGGTTATAGTATTCCTTACTCAAATTCTGTATGTGCTAGAGCTGGC 

TGGAGTCTGTTGGCATGCTCATTTGGTGTAAGGNCCGNTAAGGACTATGCT 

Sequence 125 

GCCCCGCGTCCGCACTTTGTATTGATAACTTAAAATGGCATCAGTTTATCTTAGACATCA 

GCTTGCTTTTTATCTCCTTTTTTAGTGAGTGAAATAGAGCAACTAGCATGCCTGTGTTCC 

CAGCTACTTGGGAGGCTAAGGTGGGAAGATCAATTGAACCTAGGAGGTTGAGGCtATAGT 

G AGCTGTGATTGCACGACTGCACTCCAGCCTGGGCAATGGAGTGAGACTCCTGTCTCTAA 

AACAGCAACAACAAAAATAAAGCAACCATAGTGCATAAGGGAAA TTAAA TGTTCCCTATA 

GAAATATGTGTATGTCTGTGATAAGTGGTATGCAAATGCTAATTATTTTATAAAATAAAA 

GTTCAGMCTATTCTTATCATTGCCACTTGAACMTTAMGGGTTTGCTTTATTTCCTAA 

TGTTTAATAGGAACCCTTTGCTTCAAACAGCCTTTGTTGAAATCATGTAAAAATTTGTTA 

ATAG 

Sequence 126 

CNCCACGCGTCCGGCGGCCCAGCCGCCGCCTCCCGTTCTCCCCTCCGCAGCGGCGGCGGT 

GGCGGAGAAGGAACTCGACACGCACCGACCGCCCTCCCGCCCCAGCCGAAGCGGAAGCTG 

TAGCCCGCTCTGGGCCGGGGCCATGGGCGCCCCGCGCCGCCCGGGTCATGAGGACGGAGG 

CGGAGGCAGCGGGGCCGCCGCTCGAGCCCGGGGACTTTGTGCAACTGCCTGTGCCCGTCA 

TCCAGCAGCTCTACCACTGGGACTGTGGCCTGGCCTGCTCCAGGATGGTGCTGCGGTACC 

TGGGCCAGCTGGACGACAGTGAGTTTGAGAGAGCCCTGCAGAAGCTGCAGCTGACCAGGA 

GCATCTGGACCATCGACCTGGCCTACCTGATGCACCACTTTTGGCGTGAGGCACCGCTTC 

TGTCCCAGACCTGGGGTGTNGACAAGGGCTACAAGAACCAGTCCTTCTACAGGAAGCACT 

TT 

Sequence 127 

CNCGCGTCCGCGGTCGCGTGGGCGGACGCGNGGGTTCGTCCTGGACAAGTCTGGGAGTGT 

GGCAAATAACTGGATTGAAATTTATAATTTCGGNNAGCANCNGGGCGGAGAGATTCNGTG 

AGCCCTGAAATGAGATTATCTTTCATTGTGTTTTCTTCTCAAGCAACTATTATTTTGCCA 

TTAACTGGAGACAGAGGCAAAATCAGTCAAGGCTTGGAGGATTTAAAACGTGTTANTCCA 

GTAGGAGAGACATATATCCATGAAGGACTAAAGCTAGCGAATGAACAAATTCAGAAAGCA 

GGAGGCTTGAAAACCTCCAGTATCATAATTGCTCTGNCAGATTGGCAAGTTTGGACGGTC 

T 

Sequence 128 

GGAGTCACCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGGTTAATCTTAGGCCTGAGGT 

TTGGGGCCGGGTGACAAGGAAGTTAAAC TCGT CCTCCCTGCCAGATTCTACCCCCTTTCG 

GAGCTGAGCTCCAGCCAAACCTGTGGAGTTTTCTTTGACCATTTTAGGACATGTTACTGC 
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TTCTGAGTTGGCTGCCCCAGCTGCTCAAACAAGACCTTTCTCCTGGGTTCGTAGTAGTGA 

AAAGGAGCAGCAGAGCAACTGAGGAGGAGGGCGGGTGGGAGGCATGGGACTGGGGCTTGG 

GGAGGTCAGGCGAGACCGGGGTGAGAGCTCAGAGAAGCTCCTGTGACTTCCATGCTAAGA 

TCTTGCCAGAGAACTCTGGTCAGTCCTCGGGTGTCTGGATGAAGTAAAGGAGTTAGGCAT 

TTCTTCCTTTGATTCTCTGGCTTACCT 

Sequence 129 

CGTCCGGCCCGCTCCGGGCGTGGGCGTGTTCTCGGCGGGCGTGCCTGGAGGAGGAGCTGG 

GTCCTTGTGCGGCTGCAGAGTCAGATGGGGCGGGGATTTCGGGGCACCGGGTCCTCACCT 

TCACGAGAAAGGCCCCACAGCACGTCCCCACTACCCGACGACTCACTCTTCGTGGCTTCT 

CTCTCCTCCCCAAGAGCAGGGGTGGGCCTGTCTCGCGTTCCCCTGCGGGAGTCAGGAAGC 

GTCCTTCCTACCTACCAGTCCTCCCCTCTGGTGTCTGGGGACACTTCCTGGGGGCCTTTC 

AGGTGGTTGGCGCCGGTGCAGGGCCTGAGAGCCTGGG 

Sequence 130 

GCGTCCGGTGGCATCATGACTTCTGGGGCAGTAGACTGAGCAGCAACACCAGCCACAAGT 

CCTACCGGCCTCTCACCGTCCTGACTTTCAGGATTAACTACTACCTCTCGGGAGGCTTCC 

ACCCCGTGGGCTTTCACGTGGTCAACATCCTCCTGCACAGTGGCATCTCTGTCCTCATGG 

TGGACGTCTTCTCGGTTCTGTTTGGCGGCCTGCAGTACACCAGTAAAGGCCGGAGGCTGC 

ACCTCGCCCCCAGGGCGTCCCTGCTGGCCGCGCTGCTGTTTGCTGTCCATCCTGTGCACA 

CCGAGTGTGTTGCTGGTGTTGTCGGCCGTGCAGACCTNCTGTGTGCCCTGTTCTTCTTGT 

TATCTTTCCTTGGCTACTGNMAGCATTTAGAGAAAGTAACAAGGAGGGAGCGCATTCTT 

CCACCTTCTTGGGTGCTGCTGAGTATCTTTCTGGGAGCAGTGGNCATGCTTGTGCAAAAG 

AGCAAGGGATCACTTGTGCTGGGTTTAAAATGCCGGMTTTGACAATCTTTGGGTGATAG 

GC 

Sequence 131 

GTCCGCTGGGGGCCCTGGGGCTCTCTGCGTCGAGAGCGCTCGAAGACCCGGGATTCCTGG 

CCCGATCGCGGGCGGGGGGAGACCCCAGCTCCACCCCAGCTCCCGCCGGCTCGGGGAAGG 

GGCGGCCCCTTTAAGAGCGCGCGGCCCCGCCCGCCCCCTCCGGGCAGGATCCGAATTCCA 

GGGAGGCGGGGCGGAGACGGCGGCGAGGAGGAGGCCGCGGCGCGGGACGCATAGAGCTGC 

GGCTCGGGCGGCGCCTCCTGCGGCGGCCCGGCCCGGCTCCGGCCCCCGCTGGGGCAATGC 

TCCCCGGG 

Sequence 132 

TCGCCNCGCGTCCGGGCACACACATGCCAGGCTATTTTAAGAACTACTACAACTATGATA 

AAGCTGTGAATATGTAGCCATGAACCAAAACAAAGTCTCTGTCCTTGTGGAACATTTGTT 

CTGTCAGAGAAGACAGTGTGTTGGCTCACATTGTGGTCAGTGCTGTTGAGCAAAATAGGT 

CAGAGTAAGGGGGATGGAGACTGGTGGGAGGAATGCTGCTTTATCCAGGATGGGCAGGGA 

GGACTCGATGGTGTCAGCACTGAAGGATGTAAGATCTGCTGCTCTGGGGAGAGGAGCAGC 

ATGGAAGGAGTAGAGTGCAGAGGCCATGAGGAAGGATCAGGCTTGACTCCTTTGAGCAAG 

GGGGATGGGAAGAGTGACGGNAGAAAGAGGGACAGGCCACATGGCCTGGTGGCCTGTGCT 

GAGGCCTTGGGCTTTTCTTCAAGTGAGATGAGATGCCATTGGCCAGTTTGGGCAGTGATT 

TNATCAGACTTGGTTCAGCAGGACCATNCTGCTTGGCAATGTGGAGAGCANGCTGAAG 

Sequence 133 

CGCCNCGCGTCCGAACAGGCCGGGCACCAAGGCGCAGGATTTCTATAATTGGCCTGATGA 

ATCCTTTGATGAAATGGACAGTACACTAGCTGTTCAACAGTATATTCAACAGAACATAAG 

AGCAGATTGCTCCAATATTGACAAAATTCTTGAACCACCTGAAGGCCAAGATGAAGGTGT 

GTGGAAGTATGMCATTTAAGGCAGTTCTGCCTTGAGCTAAATGGACTTGCTGTCAAACT 

TCAGAGTGMTGCCATCCAGATACTTGCACTCAAATGACAGCAACTGAACAATGGATTTT 

TCTTTGTGCAGCTCATAAAACTCCAAAAGAGTGTCCTGCTATAGACTATACTAGACACAC 

ACTTGATGGTGCTGCATGTCTTCTGMTAGCAATAMTATTTTCCCAGCAGGGTTAGCAT 

AMGGMT CATCT GTAGCGAACTAGGATCAGTATGCCGTAGGATTTACAGMTATTTTTC 

ACATGCTTATTTTCATCATCGGCAGATATTTGGATGAATATGAAAATGAAAC 

Sequence 134 
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GCGTCCGCGAMGCTGGGAAGCCAGGTCTACCTGCCCCAGACGAATTGGTGTACCAGGTG 

CCACAGAGCACACAAGAAGTATCAGGAGCAGGAAGGGATGGGGAATGTGATGTTTTTAAA 

GMATCCTTTGAAGATGATGCTGCTTTTTACAMGCATCGTTTTAAAGCACATGGCCTTT 

1 1 1 I I I I I AATTATTAGTGGTAGTAATATATAGAATGTATTACATAACTGTCACTGAAGT 

GGTTGGGGAAAATGTGGTGACTGAGGTACAGGAAACTACTAATCTTGCCATCTTGCTTTA 

AGGTGTTATGGTGGCACAGTTACTGCTCGCCTGTTAAATTTCAAATGTCCTGTTTGATAC 

TACTGGAGAACACTATTTTTAATACAGAAAAAGGTCCCTATAATGCACTTCAGAGAAATT 

AA 

Sequence 135 

TCGACCCACGCGTCCGGGAGTCCCCCCTGCCCCCCATCAAATGCTTCCTGCAATACTTTG 

CACACCAGAGACTGGGCCTCCCCAGATCCAGGGGGACAGGGGTCCCTGGGGGAGTCCCCA 

GGGCCAGCCCCTCCAGGCCAGCTGCACACACTTGACACTGATTTGCACAGTCTTGCACAA 

ATAGGGGGTAAGAGCCCAGTGGCTGGGGTGGGCAATGGGGGTAGCCTCTGGCCTAGGGAG 

TCCCCTGGCACTGCCAATGGGCACAGTCCCGAGCACACACCCCCTGGCCCTGGACCCCCA 

GGCCCCTGCCCCACCAAGCGAAGGCTGCTTCCTGCTGGAGAAGCCCCAGATGTCAGCTCT 

GAGGAAGAGGGGCCAGCCCCTCGGAGGCGCCGGGGATCCCTGGGCCACCCTACTGCTGCC 

AACAGTTCTGATGCCAAAGCCACACCCTTCTGGAGCCACCTGCTGCCTGGGCCCAAAGAG 

CCTGTTTTGGACCCAACAGACTGCGGTCCCATGGGGCGGAGGCTGAAAGGAGCCCGTCGC 

GTGAAAGCTTGAGCCCCCTTCGAAAGCCTNCGGAAGGGGCCAGGCCTGCTGAGCCCCCCC 



Sequence 136 

CGACCCCGCGTCCGTGAGAATTCAGCTTTGGAGTCCCGGGTGAGGGGTTTTAGATAAACC 

CATCMTATCACCCACATTCTGTGACTCTrTGCATCACTCGTGTTATTTATTTATTTATT 

TATATTCTGCCTTGTTCCAGAAAAGTGTTTAAGGCAACAACGCTTG I I I I I I GGTGTTTT 

GTTTTGACATTTGAAMTTTAGTACATTGTTAAAATGTACTTGTTAAACAGGTAATTTTA 

AAGAGAAGGAACAATTG I I I I I AGTAAGTTTT C I I I I IC C I I I I I CAATGAATTGATTCT 

TGAAATTAAAAGTTCTTGAGAGAAGGAGAGGAAGATACAGCAGACATAGGACTGAGCCAA 

GGAAGAGTCTGCCTGAGAGAGACGCTTGGCCTGTGCTTTGCTGCCATCCGTGCGGCCTTG 

GCCACA 

Sequence 137 

TCCGATTTTTAAATCTATTGGCCGTGTTGTCCTACCTGAAGTTCTTCAACTGCCAAAAGC 
ACAGCCCTTTTTCTCTGAGCTGGTGGTTCTGGCTAACACTGACAGGGGTCGCTTGTTCCT 
GTGCAGTGGGCATCAAGTACATGGGTGTGTTCACGTACGTGCTCGTGCTGGGTGTTGCAG 
CTGTCCATGCCTGGCACCTGCTTGGAGACCAGACTTTGTCCAATGTAGGTGCTGATGTCC 
AGTGCTGCATGAGGCCGGCCTGTATGGGGCAGATGCGGATGTCACAGGGGGTCTGTGTGT 
' TCTGTCACTTGCTCGCCCGAGCAGTGGCTTTGCTGGTCATCCCGGTCGTCCTGTACTTAC 
TGTTCTTCTACGTCCACTTGATTCTAGTCTTCCGCTCTGGGCCCCACGACCAA 
Sequence 138 

CGACCNCGCGTCGGGAAGGACCCTCTGAGCTATTTTGCGGCATACGGGAGCAGCAGCTCA 

GGCTCCTCGGACGAGGAGGATAACATCGAGCCGGAGGAGACGAGTCGCAGAACCCCGGAT 

CCGGCGAAGTCGGCGGGCGGCTGTAGGAACAAGGCGGAGAAGCGGCTCCCGGGACCTGAC 

GAGCTGTTTAGGAGCGTGACTCGCCCGGCCTTTCTCTACAATCCGCTCAACAAACAGATA 

GACTGGGAGAGGCACGTCGTCAAGGCGCCTGAGGAGCCTCCAAAGGAATTCAAAATATGG 

AAGTCAAATTATGTACCACCTCCTGAGACCTACACCACTGAGAAGAAGCCTCCGCCTCCA 

GAGCTTTGACATGGCAATAAAATGGTCTAACATATATTGAGGACAATGGTGATGATGCTC 

CACAGAATGCTAAAGAAAGCTAAGGCTTNTACCA 

Sequence 139 

CGACCACGCGTCCGGGCTGGCGAGCCCGGCTGAGGAGCCTCTTGGGTCGCACTTACCGCC 
GCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGCGACGCTGAAGCGG 
CCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCC 
Sequence 140 



AGT 
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CGTGTCCGGTGAATGGGAGCGGAACTCACAGGACACAATGAGCCGGGTCACTGATGGCCT 

TGCTTTCTAAGAATCTCACAGTGAGCCCTAGAACTCTCTACGTGGTAACACTGTGTGCCT 

TTTTCAGAGAAGAGCCTATCTTAGATCTTAGCCTAACGTTGGGTCTATTGTGTTGCTGGA 

GAGACCAGCACTGACATTCATCTCAAAGCACATGGTATGTTTGACTCCTATGTTGACTCA 

ACTACCCATCTTGTACTGGGACACTCGC 1 1 I I I I I 1 1 1 1 I I I I I I GAGACGGAGTCTTGC 

TCTGTCACCGGGCTGGAGTGCAGTGGCACGATCTCGGCTCGCGGCAGCCTCCGCCTCCCG 

GGTTCAAGTGATTCTTCTGCCTCAGCCTCCTGAGCATGTGGAGCTCAGGCTGAAGGTGAT 

GTGGCCGCCC 

Sequence 141 

GTCCGATTGATTCTTCTATGATGCGTGTTCATTATACAATACACATTCTCGGAAAGGCAG 

MTTATTTGCTTTTATGATTGTTTTTTGTGACCTMGAAACGGTTCTTCCCCCATTTTCC 

TATCCCAGGCCAAAAAATATTCTCTCACGTTTTTCTGTAAGAGCTTTATAAGTTCAGCTT 

TTATATTGAGGTCTGTGATTCATCTTGGATTATTATGTGTAGGTTGTAAAATAGGAATCT 

AGGTCCAGTTTTTCGATGTGGATATTCAGTTATTGTGGGGCCAATTGTTGAAAAGTCTCC 

CCAAAGAACTGCTTTCTTATCAGAAAGATAAGATATATTAAAAGTGTATATCTAAGTCTG 

GGTTCTCTTTTCTGTTCTAATGGTTGATATTTTTATCCTTATGCCAGAACCACACTGTCA 

TGATTGCTGTAGCTTTATAATAGTCTTGMTCAAGTTGCTTTTCAGTTTTGTATTTTTCA 

AMTTGCTTTCGGTATTCTAAGTCCTTTGCATTTCTGCTAAAATTTAGAAGCAGCATGTC 

TACCAAAAGGAAAAAAAAAGCC 

Sequence 142 

TCCGGCGGAAGAAGGTGCGTCCGCGGCTGATCGCGGAGCTGGCCCGCCGCGTGCGCGCCC 

TGCGGGAGCAACTGAACAGGCCGCGCGACTCCCAGCTCTACGCGGTGGACTACGAGACCT 

TGACGCGGCCGTTCTCTGGACGCCGGCTGCCGGTCCGGGCCTGGGCCGACGTGCGCCGCG 

AGAGCCGCCTCTTGCAGCTGCTCGGCCGCCTCCCGCTCTTCGGCCTGGGCCGCCTGGTCA 

CGCGCAAGTCCTGGCTGTGGCAGCACGACGAGCCGTGCTACTGGCGCCTCACGCGGGTGC 

GGCCCGACTACACGGCGCAGGTGCGTGCACCCCGTCCGCACCCCGCCCCTGCAGCCGCCT 

GGTCTCCCCGCCTCCCCTCCTCCCTGCAGGTTTGCGTGGCTGAGGCTCCCACCTCCTGAC 

CTCGGGGGCCGAGAGCTTTGCGAGCTGACCCCGCTTCTTCTGGCTTTGCAGAACTTGGAC 

CACGGGAAGGCCTGGGGCATCCTGACCTTCA 

Sequence 143 

CCCGCGTCCGAGATCCTGTAGGTGAAGTTCTCCTGTGCTCCACAGCCACCCAGAGGAATT 

CCAAAACCAGCAGTGGAGGACTTGGGGAGGACAGGAGGGAAAACATGGCGAGTTCATCAG 

CTCTGTTTCCTTTATTAAAATATTTCTGTAATTGGTGGTGGGAAATTGAAGAAATCAAGT 

GATTGCATCAGCGCTGGAAAAAGCTGCCAGCACTTGGCAGTGGAAGAGAATATATGCTTT 

ATACTGGACTTTTTGAAAAAGAGGCTGAGTTTGGCCAGATTGCCGACCAGCMTGGAAAA 

ACTAATTAGGTGCCTTGCCTGTGAGCCAGACGCCCAGCAGGGCTGTGGCGCATGGCTCCC 

GCCGCCTCTGAAGAGGACACTTTCTAGTGAATTCAGTTCGTGCTACCCTTGAGCAGCCTG 

TGCTACAGCAGGCACATTTGTGAATCTCCCAGCCTGTGCCTGGCGTCNGAACTGTAGCTT . 

CCCAAAGAC 

Sequence 144 

CGCGTCCGAGTAAATCTGTTCTGCACTAAATTATATCAAAATAAATATAAATAATTATTT 

CGTATACCATTCGTTATATACATTTCAGTGTTTTTMTAATGTCAATTTCTTATATTTTA 

AGGAGAACTCTAGTTATTTTTATMATCTMTTGACTTAATTTTGTGGGAATATAAAAGA 

AGTGATTAAAMCCTTTGGTTTAAAGTAGTTAATCCTGAAATCAAGCTCTGTAAATATTG 

TGTAGGGATATGGAGAAATCCTCAAAAAGAAAGAGCTAAAGAAAATGGCAGGGATGGCAT 

CTTGGGAGTATAACTGAAAGTAGGAAGATGTGGATAGAAGAGTCTTATTTTTAATCACAG 

GGCATATGTGCTATTTGAATTATTTTGACAAAAGTATAAAAATATGGAATTATGCATTGT 

GTGTGTGTGTGGTTTCATCTGTTTAAGAAATATGTGATGGAGACTGTCCTATCATCAGGA 

AATTATTCCAGT 

Sequence 145 

GTCGACCNCGCGTCCGCGGAGACACCGACCGCGGCGGCAGCAGCAGCAGCAGCAGCGAGA 
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GGCAGAGGCGGCGGCGGCGGGGAGGACAGCACGGCCGAGGCTGCCCAGAGGCGCCTCCTC 

CACACCCCCCGCCGCAGCAGCACCGGCGACAGATTTTTTAAAAAATGGATTTGGCCAACC 

ATGGACTTATTCTACTGCAACAGTTAAACGCTCAGCGAGAGTTTGGTTTCCTGTGTGACT 

GCACGGTTGCAATCGGCGATGTATACTTCAAGGCACACAAATCAGTTCTTGCTTCATTCT 

CCAATTACTTTAA 

Sequence 146 

CCACGCGTCCGATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAACCATGTCCAGCCCTACA 

TCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGGCTTCACTGAGGTGCGAG 

ATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCATCAACGAGGGCGGCATTG 

ACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTGGCGTACCACCCCCTGAAGATGC 

AGAGCTGCTATGAGAAGATGGAGTCGCTGCGACTGGACGGGCTGCAGCAGCGATTTGATG . 

TGTCCAGCACGTCCGTGTTCAAGCAGCGAGCCCAGATCCACATGCGGGAGCAAATGGACA , 

ATGCCGTGTATACGTTCGAGACCCTCCTGCACCAGGAGCTGGGGAAGGGGCCCACCAAGG 

AGGAGCTGTGCAAGTCCATCCAGCGGGTCCTGGAGCGGGTGCTGAAGAAA 

Sequence 147 

NACCACGCGTCCGCCNCGCGTCCGCTTGCACCCGGTGAAGAGCGTGCGTGTGCTGAGGCC 

GGAGCCGCAGACGGCTGTGGGGCCCTCGCACCCCGCCTGGGTGCCCGCGCCTGCCCCGGC 

CCCCGCCCNCGNCCNCGCCCCCGNCCNNGCTGCGGAGGGCTTGGACGCCAAGGAGGANCA 

TGCCCTGGCGCTGGNCGGCACAGGCGCCTTCCCGNTGGACGTGGAGTAC 

Sequence 148 

TCCCAAGAGCTGCANGNNNCAGCCGCGACAGCAAGAACCGGNAGAGCCGGCAGACCGCGG 

CGGCGGCGGCGNCGGAGGCAGGAGCAGCCTGGGCGGGACGCAGGGNCTCCGCGGGCGCAG 

GAAGGCGAGCAAGAGATATNCTCTGAGAGCCAAGCAAAAGAACATTAANGGAAAGGGAAG 

GAGGAAANGAAGGCTGGATACCGGNGCAGTGAAAAAAGGCACTTCCAAGAGNTGGGGGCA 

CTCAACTACGCCACNAGACTCTGACCGGGTGGCCCAATCAAGCCAATGAAGAAACCTATA 

ACCCGGNTTAACCTNTNATTGGCCTGCCTTCTNNNTNGGGGGGTGGGGGCCCAAGCCCNC 

TTMCCCCCCMCCTTCNNTTTCAAACCTATCCCAACCTTATTCNAAAANGAAGGGGANC 

CTNAAGGATGGGGNTTTCCCCCAAGGCNAAAAAGGAAAAAAAGGGGGCCCCCNNGGGAAG 

CCTTCNTTCTTGGGGAAAAACAAGGCAAAAAAAATTGGAAGCCTGAAAACCCGGCTTCAA 

AAAAAAGGGG 

Sequence 149 

GGCCGAAAGGGGGGCAGGTGGTCGGGCCGCGCAAGCGGAGATGGAATGGGGCCCGGGCTC 

AGACTGGTCACGGGGGGAGGCTGCCGGCGTGGACCGCGGGAAGGCGGGGCTGGGGCTCGG 

CGGGAGGCCACCCCCACAGCCGCCCCGGGAGGAGCGCGCCCAGCAGCTGCTGGACGCGGT 

GGAGCAGCGGCAGCGGCAGCTCCTGGACACCATCGCAGCCTGCGAGGAGATGTTACGGCA 

GCTGGGCCGCCGGCGCCCGGAGCCGGCTGGTGGCGGGAACGTCTCANCCAAACCTGGAGC 

G 

Sequence 150 

CACGCGTCCGGCCTGCTGTTNACCTGCGGGACCCCAGGAACCTGGACTTGTTTCTCAAAG 

TGGTTCATGGAGATGTCACCCCCTACGACCTGGTGCGGATGAGCTCGATGCAGCTGGCCC 

CCCAGGAGCTGGCCCGCTGGCGGGACCAGGAGGAGAAAAGGGGCCTGAATATCATTGAGC 

AGCAACAGAAGGAGCCGTGCAGACTTCCAGCCTNCAAAATGACCCACAAGGGCGAAGTGG 

AGATTCAGCGGGACATGGACCAGACACTGACCCTGGAGGATCTGGTGGGACCGCAGATGT 

TCATGGACTGCAGCCCACAGGCCCTGCCCATCGCATCAGAGGACACCACGGGGCAAGCAT 

GACCACCACTTCTTAGACCCCAACTGCCACATCTGCAAGGACTGG 

Sequence 151 

TTTTTCCTTAGMTCTTCGAGMAAAGATGAAGGTATTATTCTCAGTTTCGAGATCAGGA 

CTCCTCACCACCAGGCGGGGGCTTTAAGGTAGACACTACAGGGAATCTGATCTCAGGGTG 

ATCCTCTCCCTTCCACTTGCAAAAAGAGAGGAGCAGGTGGGCCACTGCTCTCTGAGATGT 

TAACACCCCTCACACTCCACGGGCATGCTTTGTCATTCTGCACACCGGTGTAGCTGCAGC 

TCTGTGTGMTTCAGATCTCAAGAGMATGTAMTCMAAGTATGAGTTTCTTTCTTTCT 
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TGGGTGCCACAGTAGGAATGAAATGATGGGGACTTTTGGAAGCCCCTGGACTTGTGGCCC 
CTGTAGAAGAGCAGCTTGGGCAGGGTGTGATGGCCATCTCTGTCTCTAGGGGCCCTGTGG 
A 

Sequence 152 

TGGCGAGAGCGCTGGTGCCGAGTGAAAGATAACAAGCTCATTTTCCACAAGGACAGGACC 

GACCTGAAGACCCATATTGTGTCTATTCCGCTCCGTGGCTGCGAGGTGATCCCGGGTTTG 

GATTGTAAACATCCTCTGACGTTCCGGCTGCTGCGCAACGGCCAGGAGGTTGCAGTATTG 

GAGGCATCTTCTTCTGAAGACATGGGCAGGTGGATTGGGGATTTTACTCGCAGAGACGGG 

GATCGTCCACAGACCCGGAGGCTCTGCACTAXGACTACATTGATGTGGAAGATGTCTGCA 

ANGTGTCATTCAGACAGGCCAMCAGACCTTTCTGTTTTTATGAACAGGGCGTGTTATAT 

GTGCTAACCCATATCTAGGGGGCACCTTCAACGGGTTATTGCCCACCCAGCGGGACGGCA 

CTTTAATATGACGATGTTTCCGTNCCATAAACCGGNTTGNTTAAAGGGTAAAA 

Sequence 153 

GTTCGACCCACGGGCGTCCGCGGGACCGCCGTGGGNNNNACATACTATGCGNACAGGCGC 

GTTGNACACAAANGGCCCATTCCTGTAGCCTCACACTTGACTACACATGGGGGANTCACT 

CGGATTCGGNTCTCCACGTGGNNGNTCTTTGTTCTGTACTCTACGTAGCTTTGGCTTTTG 

TTTTCTCGTCGCAACAGGGCATGAGACTTCGTCGACCTTNGGGGTCTGTATAGTCTTTGA 

CTTACTACGTGTAGGTCTCAATACAAAGTGGGANATANTCATATCCGTCCGCGAAAAGTA 

ATTCTTGGAAAAATTTACCCTTGTCTCCCGCNTTATGAAACGTGAACTAAGTAACTCACT 

TTGCCGCTGGGGCGCCTCTNTTTAACANTGTTCTTTTGNCGAAATCATCATAACCTTCAA 

CTGAAAACAATGTGGTCAACAAACTGACTATGGAGGTCTTAGGCTCNGTCTCTAAGATCT 

TTAACCTTGTTTATCGGCGCGTGCGGCGTNGTCCGAACGAAGAGACTATAACCCGCACTA 

TAACNAAAACTCTTTTTTAATCCCACCACCTCGTGAGGGANGGGCCCTAAGACTGAAACT 

GTAGTAAGTCCTATTGATTTGCGTAGGAGGANTTAGGAAA 

Sequence 154 

NCGCGTCCGATAGTCTACCAGCCTTACCTGGTTGATTACACTTGTAAAAGAAAGATTAAA 

AGCAGGCCAGTGACTCTGGTCTGCTTGAACATGTGAATGTAGTGGTTTGAGCAATCTGGA 

GTTTGCCCTAGTGTCAAATTCCAGACTGTCCATAGTGTCCAAAACCTGAGGCAGACACTA 

ATGTTAACCCCCAGCACCCCGTGATTGGAAACAAACCTAAATACGTATTGGGAACTTAAT 

AGCAATTTTAAGCATTCTGATAGATTTTTTGTAGGGATGGGGTCATGCCATGTGGCCCAG 

GCTGGTCTGAAAACTCTGGCCTCAAGTGATCTCAAGCTTTGGCCTTCTAAAGTGTTGGGA 

TTACAAGGTGTGAGGCATTGCACCTGGCTTAGCCGTCTTGATTTGACATTGTAATGAAAA 

AGTGTGAGTCTTATTCTACCAGGGGCCC I I 1 1 1 I GTCCTCTTGAAAATNGAATAACCANG 

GGAAGGGGGAA 

Sequence 155 

CCNCGCGTCCGTCCATCACAGCCTCCGAAGGTGCTGGGATTACACGGCATAAGCCACTGT 
GCCCAGCCTGI I I I I AATAATGATATTAAGTGGGTTTGGTTCATGTGTTATTAATCAGTG 
TTAATAATCGTAC I I I I I I I I I I I 1 1 I I AAAAGAAACCATGGGTATTCTAAAATCAGGAG 
TCCAAATAAAAGAAAGTTCTCGGCTGTGCGTGGTGGCTAACACCTTGTAGTCCCGGCACT 
TTGGGG 
Sequence 156 

CGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGT 

AGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGG 

AGGGAGTCTGAGGGTTGGGGACCGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGG 

CGGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGT 

TGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCGAGGCCCGCGCTTGAGGGATCT 

GAAGAGGTTCCTAGAAGAGGGTGTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTT 

GGGGGTCGCCCTTCTGAGGAGGCTGCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCC 

AGMTCCCCTGTAGTTGATMTGTTGGGMTAMGCTCTGCACTTTCTTTTGGCATTTCA 

AGTTGTTAAAAACAAATAGGA 

Sequence 157 
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CGCGTCCGGGTGTCNAGGCCATGGGGCAGCCCTGGGCGGCTGGGAAGCACGGACGGGGCG 

CCCGCGCAGCTGCCTCTCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTG 

GCTTACGTGCTGGTGCTCGGTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTG 

CAAGCTGGCGCTGGCCGCCTTCCAGCTGCTCAACCTGCTGGGCAACGTGGGGCTCTTCCT 

GCGCTCGGATCCCAGCATCCGTGGCGTGATGCTGGCCGGCCGCGGTCTGGGCCAGGGCTG 

GGCTTACTGCTACCAATGCCAAAGCCAGGTGCCGCCACGCAGCGGACACTGCTCTGCCTG 

CCGCGTCTGCATCCTGCGTCGGGACCACCACTGCCGCCTGCTGGGCCGNTGCGTGGGCTT 

NGGCAACTACCGGCCCTTC 

Sequence 158 

CGAGCACGCGTCCGGGGACTCAGGCATGCACCACCACGCCCAGCTMTTTTTGTATTTTT 
AGTAGAGACAGGGTTTCTCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACATTCGGTGA 
TCCACCCGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCAGCC 
CCTTCCTGTTGAGTAAAAGGAAGAACTTCAGGGTAAGACACTGTACAGTGCCCAGCATCT 
GGAGAGCCGCCAGCATTACCCCTGCCTTAGGAGGTAGTCGTCTCCTCATCACTACAAGGT 
ATTGAAGCCTGAGGGCCCCTGGGCAGGACGATAGAGTGAGATTGCCCTGGGGACTCAGGA 
AAGGAAACATGCCGTATTTNTAGGGAAGGAGCTGCTGCTGCCTCTCAGTGACTCTGGTTC 
■ CAGGAGGGAAGAGCCGAGAGCTAGGGTTCCCTTTCATAGGGAGAAACCCAGCAGGGTTTG 
GGGTGTTCT 
Sequence 159 

ACCACGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGG 

AGGTAGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAA 

CGGGAGGGAGTCTGAGGGTTGGGGACGTCTGTGAGGGAGGGGAACAGCCCGCTCGAGCCT 

GGGGCGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGG 

CGTTGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCCGAGGCCCGCGCTTGAGGG 

ATCTGAAAGAGGTTTCCTAGAAAMGGGGTGTTTCCTCTTTTCGGGGGGTCCCTCACCAA 

GAAGMGGTTCTTTGGGGGGTCGCCCTTNTTGAGGGAGGCTTGCGGNTTAACAGGGCCAA 

AAAANTTGCCAT GGGAT GTCCAAGAATCCCCTGTAATTTGATTAAATGGTGGGGAAATAA 

AG CTTTG C AACTTTTTTTTG CN ATTT AATTTG GTT A AA A AC A 

Sequence 160 

TCCGCTCCCCTGTTTTCTTCCTTTTTC I I I I I GCTTGTATGCACAACGGTAGGACTTACT 

TCGTMGAMCAAMTGCCAGTATTTTCTTMGCCATGATGTGAAACCAATGACCCTGTG 

ACCACATGGCACAGAACACTAAATTTTGGTCCCATGGCTGAAACTTGAGGGTGACTAAAA 

GTAATGCCTGTGAMCATGATATCTATCTGGGATGGCCATTTGATCTCTAAAAGGAATTT 

TGTNCACTCCACAGMCTCCTATCTATAGTAAAATTTGATTTTTC 

Sequence 161 

CGTCCGGAAMATATTAAACAACTCATTTTMGATTCAAATTAACTAATTCCTGCATATA 

TGACATTCCTTACATMGCGAACACTAMCAMMTGGCTAGAAATGTCT1TTTCTTTCT 

TTTCTCTCTTTGTTGTTTAAGGTATTMGCACCGMTTATTACATGAGACTGGCAGATAG 

CT ATT AATCCTCTTACAGATTTGAGAMGTTGATTCTCAAAT ATTT ATGCACCTTCTCCT 

TCATTGTTTTCTTTAMTCTGTCCTCTTMMAGCTTCTTAAGAGCTCAGTTMTGCTTT 

TGACTTAACTAGGAGAAAAAGGCATGATAATACAGGCAAGATGGCATTGTTAGCAATTCT 

GGTAGGTGGTTTTGGAATGAATCCTAAGAGGCAAGGGGATCTTAAGGACAAGGAAGAGAA 

GAGAGAGGGGGNGGGATCCCTTTGATCTCTTTCTCTGGNAATCTTAMTGCNTMTTTTA 

CTAAAACATGTTCTCAATTCATTCATAT 

Sequence 162 

CCCCGCGTCC GGAT TAATGAGTGTATGCCTAGCTCTTTCTCCAGTTTACTTTTAGACCAT 

ATTGTTGTTTGTTTTGAATATCATTCCTTAGGCTATGTTGAGAGTAGAGTGGCTTCCCAT 

TAGGAGMCTMTTTAGGGCATGTCTTTTGCTGMTCCCGTCAGCATATTTAACAAATTC 

CCAATTCTAGATMTTTCCTTTTATTTCTCTAGTACCCTTTGCCAGGGGCTCTACACATC 

AAAGGTGTTCATGAAGTATTTGTCAAAGGAAAGAACAGTAATGACACCTAACACATAATG 

AGTGATTAGTATGTTCCAGGCATTGCGTGAGCTATTTACTGTGAGTGATTTMTGTTATC 
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TTCCCAGCAGACCTCTGAGGTAGGGTACTAGTATGATCCCCATTTCGTACATGAGGAAAC 
TGACACTAAGGGACATAAAATAAGTTTTTGAAGTCACAAAGTGAATAAAAGGAAGACCAG 
GGTTTTAATTGGAAGCCCATA 
Sequence 163 

TAGACTTTTGCAGTGTTAAACACAGCTTCCTTAACTCTTAGAACTGGAAGTTGTAGAGCT 

CTCCTTTTGGTGCCTTTCCAGCCTTTATACACACTATTGTAGCTTTCTTAGGTTTGATAG 

GTAGCGTTTCAAGTAGTTTAGCTGAGACAGNGAATGTATTAGGTTCAACATGACCTTGTG 

TT7TATTTGTGTTTGCCAACAGGATGCCTTATTTGTTTGAGAAAAAGATGTACTAGTGTC 

ATTCTAMCTATCTCCTTTTTTAGGATTCTAMGMGTTMTCATCATCCTTTTGTTTAT 

TTTACCACCATTTAGTGCCTTAAATCCTATCAAGAAAGCAGTGTTACTGCTCAATGCCCA 

MTAAGACACGCGGATATTGCTATTGCTTGCTTTTGAGTTAACAGGCCNACTTTTTATAC 

TTAAAACCTCA 

Sequence 164 

GCCCCGAAGTCCCACTGTCCCTGCCGAGGCGCCGGCGCCGTCCCCTGTGCCCTTGACCAC 

GCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGCGAGCGGTCGGC 

GTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCG 

GCCGGCCCGGGCCCGGGGCGAGATTGGGGGCCTGTCGCTGTCGCCATCGCCCGAGCGCAG 

GAAACCGCCGCCACCTGACCCCGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCA 

CAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCGCCGCCAGGCTCCGAGGCCGCGCC 

CAGCCCCAGCAAGAGGAAACGGACGCCCAGCGAACAAATCGGCGCATACGGTCGAACACN 

GGAAGCCCCCGGACCCGGACCCAACGCCGGGGCCCCCCG 

Sequence 165 

AGTCCGCCCCACGCGTCCGGTGAGTTTAGCGCTGCTGTCCGGATGGGTTGGTAGCAGACA 
GGGTGGAGTAGGGTTAAGCACACTGGTCACCTTAGGATTGGTTTCCTGGTGCTGGAGAAT 
GGTTAGGACACAGGCCTTGGAAGG I I I 1 1 I GAGTGTGAAATATTACTCAGCGTTTTCTGC 
AGACCTCGCGGGCAATGCCGCTTCTAATTTTATCCAGGCCTTCTTCTGTAGGGAGGGCCT 
GTTAAGAGTTGAGCAGCCCGATTTCTGAACCCCTCTAAAAAGCTGTGGCTGATTGGTGGC 
1 1 1 I I I I 1 1 ICTTGGAGAGGGGGGTGTCAAAGATTTCTTTAAAATCGTTAGTGATGTGGT 
CTCGCTTTA 
Sequence 166 

ACGCGTCCNAAAATGTGTGGTACATGGAATAI 1 1 I 1 1 ATTATGCTTATTTCGTATTGCCA 

GGTAGATGGCCAGCCTGACATTCAAAATTATTTTATCAGCCCCTAAATGTTAATATTTCC 

CMACTATTTAAATCAGTAGAAGACATTTTTACTATTAAGAATAAAAAGTTATAATATAA 

MTGGATTAAATGCCAGATTATATGCTAAACAAGTCCTTTAAATTTTTAACTTAATATTT 

TTAACAGA I 1 1 1 1 1 1 1 1 I GAGATGCAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATG 

GCACAATCTTGGCTCACTGCAACCACCACCTCCCGGGTTCATGCGATTCTCCTGC 

Sequence 167 

CCGTCCGCGAGGTTAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCATCTCTA 

CTAAAAATACAAAAAAAATTAGCCAGACATGGTGGCAGCCTCCTGTAGTCCCAGCTACTC 

GGGAGGCTGAGGCAGGAGAAGGGCATGAACCTGGCAGGCGGAGCTTGCAGTGAGCCCAAG 

ATGGCGTCACTGCACTCCAGCCTGAGGGACAGAGCAAGACTCTGTCTCAAAAAAAAAAGA 

AAAAAAAGTGGCACAGATCAATTATAAATCACTGCTTCAAGGCCAGTGCTCTCACTTTGT 

ACATTAATAATCTCAGGCCCAAATAAGATAAGTGATATGTCAACGTATGTTCACTTTGGT 

CTTTACATGGCAGCTATAGTATACCGGAATATTATAAGCTCAGATCGTCATAGCTACATA 

ACTCCTTTAGTTGGGAAGANACGCCGTAAATGCCCATCAAGANTAGCAAGTCTTGCATTT 

GACT 

Sequence 168 

CGCGTCCNGGTAACTTGAATAAGGATTATGTGCCCCACCCTTACTCTCATTCCTGCTTCC 
TCTTGGGCTCAAACAGGGTATGAGTATGAAGATTTTGCCTTTAGTTCCTGAACTGMCCT 
GCT TGCTA TCCCTTTCCTCCCCACCACTACCTTATTCCTTCTCTGCCTCCAAATTGCCAC 
TTTGTTTTGAGGCTTCCTTCCCTACCTTATTATTCTGAAGGAAGTAGAGATCTTGCTTCT 
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GAAACCCCTCCTAAGAAACTGCCCAGGGACAAGATAAATTACAAACAATTCATGGGAGTT 
TACTACCTAAGTTGCTTCTAGGGCATATGTATACCATACTAGTAGTCTAGATTTCTGG 
Sequence 169 

AGGGAGTCGCCCCCGCGTCCGGACAGGATCTATGGATGGAAGAAGATGAAAACAAACACA 

CAGCCATAAAGGTMTTGTTTCTGGCCMCATCTTTCTACTGATGCTTTGTTTTGATTGT 

ATGTTGCTGTTTATATTTTCTCAAACTTGAGGCTCTATTTTATGAAATGTTGAATATAAA 

TACATTGTATTTAACTTGAAAAATTCCTGGAAATATACCTGATAATTACCACCTGAGGAA 

TCGNTTTATTTTATGAAAGTAACAGCGTGATGAATACTGTAATTACAAAAGAAAATTAGT 

ACTCACTGACTTATACCCTTG I I 1 1 ) 1 1 I 1 1 1 1 1 1 ) I GGTTTGG 1 1 1 I I 1 1 ) GGTTTTGN 

TTTAGGNGACA 

Sequence 170 

GGAGTCGNCCCACGCGTCCGGCTGGCCAGCAGACCCTGCAGCAGTCATTGGTGGTTCTGT 

CTTTCCCTAACCCTTATGNGCTGGAGGACAGAGGAGCCACCTCTTCTCACTGGTTGATAC 

TCTGTTCCTAAGACCTCAGGTTACCCAGTGTAGCCTGCTGTGACGCTTCTGCTTCATTTT 

CTGCCAAGTGATACAGAACCTTCGTTCTGCTGCTTTGGGGACAGGTGATCCTAGCCCCAG 

CTTAGGGCCAGTGCTCTGTGCCACTCTGGAGAAATAGGGAAAAGATAGGGGTGGCTCAGT 

ACAGCAGCCCTGTGAAAGTCAAGGCCAGAGCTTTTC I I I 1 1 IAAI I I I I I I ATTATTATT 

ATTATTATTT 

Sequence 1 71 

TTTAGGGAGCCGACCCACGCGTCCGCTTGGCAAACCTCCGGGGACTGTCAGAGGAGGAGA 

GGAGCGAGAAGGCTATGCTTCGCTCCCGCATTGAAGAGCAGTCCCAGCTCATCTGCATCC 

TGAAGCGGAGGTCAGATGAGGCCCTGGAGCGCTGCCAGATCCTAGAGCTGCTCAATGCAG 

AGCTGGAGGAGAAGATGATGCAGGAGGCTGAGAAGCTCAAGGCCCAGGGTGAGTACAGTC 

GGAAACTAGAGGAACGCTTTATGACCCTAGCAGCCAACCACGAGTTGATGCTCCGCTTCA 

AGGATGAATACAAGAGTGAGAACATCAAGCTGAGGGAGGAGAATGAGAAGCTGAGGCTGG 

AGAATAGCAGCCTCT 

Sequence 172 

CCACGCGTCCGCTTAGCCGCTGCCAGAGTTCCATATGTTCTGGAACCCTTGACTCCTAGA 

GTTCAGAACCCAGCCAACTTGCAGTTTTCAGAATGTTCAAGAAACTTCTGACACTCAGAG 

TTGCAGAACCTCCTGGTCCCTGCAGATTCCTGGAAATCAGAATATGGTGGTTGNAAAGAA 

TCTTGTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCG 

GGCAGATCGCCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAATCCCGTC 

TCTACTGAAGATAACAAAAATTAGCCGGTCATGGTGGCGCCCGTGCCTGTAATCCCAGCT 

CGGCAGGGCCGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAAGAGGTTTGCAAGTGAGC 

CNAAAGATCGAGCCACTGCACTCCACCTTGGGTGACCAGAGTCTTAAAAAAAAAAAAAAA 

AAGG 

Sequence 173 

CGTNCGGTGAATATTCAGTMGCTMTTAAAATTTGTAAGTAAGATTTCCTTATTAAAAA 
AACTATTAAAATAAGANGAAGCCCGTTNAAATAATNATTAGAGNNGGAAAAAAGAAGATG 
ACTATCNMTTACAGCACTTTTTTTCAGCTATACATAAAGGCCTTTTCANT 
Sequence 174 

CGTCCGGTGACCCATTAAGTATATTTCGTGACCCANAGTTTGAAAGAGATTATAGGTGTA 

GCTTTGCTAGTTTGTAATTGATATAGAACAGTGACTATCAGGGAAGNTGAAGAACGGCNA 

ATTGAATGTAAATCATGTCTGGATGGTGAAGATTCTAAGAATGCANCTAGGGAAAGGGCT 

GCAAAAAGAAGGTGGCAGACTAATGTAGAATGGTGCAACCAGATGAAGACATGGGTGGCT 

TTAGGAATTCCAAAGTGGCCGTGAAGGCCAGGCACGGTGGCCCACGCCTGTAATNCTAGC 

ACTTTGGGAGGCCAAGGTGGGTGGATTGCTGAGCTCGGGAGTTCGAGACCAGCCTGACCA 

ACCAGGTGAAACACCATCCCTACAAAACAT 

Sequence 175 

GGCGGCGGCGGCGGCGGCGGCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGTNGCGGG 
TGCGGGAGATGCCGTGCGGGACTGGGGCCACNTTGAGCCGCCCGNCTCGTCCCCGCCTTC 
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TGTGGGAAGGATGTGCGCGCGGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTA 
CGGCCCCTGGCTCTGCCTCCTGGTGGCCCTCGCCCTGGACGTCGTGAGAGTGGACTGTGG 
CCAGGCTCCCCTGGACCCTGTC 
Sequence 176 

GAGCTGGCTGGTGTTTGAGCTGTGGCAGAAGCACCTGGGGCTCCAGGGAAGCANGCTGGG 

AACTGCAGGACCTTGCTCAGCCAGGAGCACTTCCCCCTCCTTGAGGCAGGAATACTGAGG 

TGCCTCCCCACAGATGGAGAANGTGGAGAGGAGGATGGGCCTCAGGAGCATCTCAAGCCC 

CAGTAGCAGGANAAAGAAAGMAGAGATGCCTGGTTTTCACAGACTGGTTCCTGTGGCTG 

GGATGACTGCATCC I I 1 1 I I I I I I I I I I I I GAGACGGAGTTTTTGCCTTTGTCGCCCAGG 

CTGGAGTGCAATGGGGTGATCTCGGATCACCGGAACCTCCGCCTCCNGGATTCAAGCAAT 

TNTCCCGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCTCCACCGTNCGGCT 

AATTTTGTA 

Sequence 177 

CCTTGTNAGGGGACACAAAGAAAAATTGAATAAACTGTATGATTTAAAAGATTATCGGGA 

GAGTTACCTCCCGATATAAAAGGAAGGATTTACAGAATGTGACCTAAGGTCTGGCGTAAA 

TGTGCACCGGAACCGAGAAGGCCCGGATTGTCATGGACGATGAGATACACCGGAATATCA 

TGGACATATTCTTTAAAGCGCCCTTTATCTTCAAATGCGGCACGGAAACCGGAGGCTTTG 

AAGAACTCAAGGAAGCGCGGCACGATACCGCCCGCAATAAACACGCCGCCAAATGTCCCG 

AGATTGAGCGCCAGATTGCCGCCAAAACGGCCCATAATGACGCAAAACAGCGACAATGCG 

CGGCGGCAATCGGTGCAAGCTGTCAGCCAGCGCCGCGTTCGGTAATATCTT 

Sequence 178 

CACGCGTCCGACCGGAAATGCTGACCTGACCTTTGACCAAGTACGNGCGGTGGGGGGGGG 

GGACAAGTGGGGTGGTGGTATTAACTGGCTCCGGTGGGTCTTCAAGCCCCAGGAACCCTC 

CAAGGGGGAAACAAATGGAGGGCCCTAACGCAAGAAGCTTCAATCGGTCCCTTGGACTGG 

GCTTCTTTCGGTGGGTTGGTGGGTATGGCCCTGGGCTGGCCTTTNCCTNMTCTTTCCTT 

CCCTNCCTGGGGGCAATCTGGCTGGGTGGCCAAGTGGCTTGCCCCGGCAACAACTTTGGC 

TGGCTGGCTTACGGTCAANGGGTGGCCCCTGGCTTGCCCCAAGAACAAAGGTGGCNTTGC 

TTTGCCCCCGGAAGGGCCCCTNGTAATTGCCCGGCCCCGGGCCAAAAAAGCCAAGCCACC 

TTCMGGGTGGTTTCCCAAGCATTTTTMTTGCCCCCCCMGCAACCCTTAATTGCCCAA 

CCCTT GTTCTTTCCCCGGCCCAAAAGAACCCCCNAACCCCCCCAACCCAAGCTTATTGGA 

ATTTTCCCMATGGGGGCCCCCCTTGGCCCTTACAAMCCGGGGGTTTACCCCTTTGGGG 

AGGGGAATTAACCCCCTGGGGMGGMACCGGTTTTGGACCANGGGGMGGTAAGGCTTT 

MANCCTTGGGGTNGGGGCCCAAAAGGGCTTCCCCTAAATTGTTAACCCCCCTTGGCNTT 

TT 

Sequence 179 

CGTCCGGAMACTGTTCATCTACTCACTGTAGTGCCCTCCTTGAMTGTGTGTTTGTTCA 

TTCMCTMCAATATTTGGGGATCCCTGTAGTAAACACTGTATGMTTTTACACAGTCTG 

GCCATCAAGAAAGATCACGGAGTATATTCTAGATGGGGAGGCTACTAAGTGAATAGGAAT 

CACCACGCTGGGCTGTTTATTAGGTACAGTAATAAACATAAGTACTGGTTGCAAAANAAA 

ANAANAAAANAAAAAA 

Sequence 180 

CCNCCGCGTCCGAAAAGACAAGACAGCATACTGTATTTTTCCTCTTAAAATTCAATGTTA 

CAATTMATGATTGTT^CTGAGAATAAGTTAGCTTCAGCTTTCTAATCGATGTGTTCCC 

ACATCTACAAATTGATATGAAMATTATTTTGAAATGCACACTGCAAAATGGTGAGAATA 

TGAMGTTACCTGGGAATTAAATCAGAACTGTCTCCATATGACTATTTCCAAGTCACAAT 

CATAACTTTCTTMTAGCAATGGTTATATATGTGGCCAGATAGTATTCAGTTTCACAGTA 

ATGTCTCGGTCACATAAAGATAGCANAGCATAGACATAGTACAACMTTTATTATTTCTG 

CTGATTGCCAMTGTGCATMAACTATMAGATATATTTTCCAGCCCAGGTGACAGAGAC 

CCTGTCTCCCNTTTNAAAANCTTCATGNTAAAGGTGCGGCCGCTAGACTAG 

Sequence 181 

CGCGTCCGCTAATCAACTTTTAAAAATAATGTTTTACGGCCGGGCGCGGTGGCTCACGCC 
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TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGGAGATCGAGA 

CCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCCGGC 

GTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTG AI I I 1 1 I AI I I I I IGAACT 

TTTTTACAGAGAAGGGGGGTCTCCCTATGTTGCCCAGGTTGGTCTCAAACGTTTGGGCTC 

AAGTGATCCTTCTCACCAAGAGTGCTGGGGATTATAGGCATGAGCTGCTGTGACCGGCCC 

AAAATCAAATTTCAAAACTAAAAAAATTCTCAGATAATTAATGAACCTACGTGAATTAAT 

CTACAAATTCCAGTTTGAAAAACACTAAAGATAAACAACTGTGAATGTGGGAAACTTAAA 

AAGTANGTTGGTATTTAGGTTATtGGTTAAAATGGGGGACCGACTGGCATACACAGTCCT 

AAATATTTAAGTCTTAAG 

Sequence 182 

CCCTATCTTNCGGTACTGGTGGGCCAAATTCCTGGGACCAGGTCAAGGTGGGCTGCCTCA 

GTAAGAAGGAAGGACTGGAGAGTGCCCATTTCAGAGGAGCAGGCTGGTGGGGGCCAGCCA 

GAAAGTAGTTCCCTTTGGGGGGAAGATGTTGGACCTTTATTATTTGTGGTAACCAGCCGA 

GGCTGGTTGTCAGGACAGCAGGTGAGCCACTTTAGGGAAGAAAGTGCAGGGGTGGGTGGA 

TGCCCAGATTACCAAGGCCAGCCACCCTGATGGGGTAGGGTCTGGTTATCTGTGTTCAAG 

A AGCA AATCCCACCCCAGCCCCAGCACTAGCTCTCTATGTATGTATTTTCCCTGTACAAT 

GTTTTATAAAAGAGATCATTAATTTAAAAAAAAAAANAAAAANA 

Sequence 183 

TCGTCCATTTACCTCACTTATGGGGTAAAAGGTCACTTCAAGTAAGGTTAAAGGTTTTCC 

CTGGCAAAGGACCTAACCAGAGCCCCNAAGGGGGGGAAAAAAGGAGGTCACCTTTGGGGA 

GGTACCATGGCCCNTTTTGGTCTGGCCCNTTGGGCNTCTTCAACAACAAAGGAATATTTT 

ACCAGGCCCTTTTGGAGGCCTTTGGATMTTTCTTAAGAAATTTGGTTACCAGGAAGAAT 

TTAGGCTTCNTNGGNAAAAAGGAAAAATTAGGACCTAGGAAAGGGAATTAAAGGGGGNAA 

GGGGAAATCAATTAAGCNTTAATGGAAAGGGGGTTTTACTTCTGGCAATCAAGAACCGGC 

TTTTTCm'AAGTTTCNTAATGGAACTTTAAACCGGTNCCTAATANGGGCTNGTAAANGGG 

GTTGTCNrrGGCGGTGGNAAACCACCTTTCTTTTTCCTNGGGCCCTTCCCTTTTCTGGNCC 

CGCAATTTNCCCTTCTTTNAAACCCTTCAAAGTTTGCCCTTGGAGGG I I I I I AATTAATT 

CCCCCTTGGTGGCCANTTCCCTTGGGGGGCCMTTGGGTTTCAATTTCCANCAATTANTG 

GNAAMACCMANTCCAAGGGGAAGGGACCCCCTTNGGGCTTMNTTTTTCCTTTTAAAT • 

CTTCCTGGGAATTTTNGGATNGGGGGAAMAAAAATAMTTAAMTTCCTTTTTGGGGCG 

CCCTTGGCCMGTNGGGGGGGMAMTMTTTNGGNTTTTNGGGGGGGGGGNTTTGGGGA 

ATANCCTTACCAAGGMCCCCTTCCTTGGATNCCTTTTGGGGGGTCTTTTCAAAAAAT 

Sequence 184 

GCGTCCGGTTGTAGTTTCCCTTCCCATTCTCTTGGTGGCCCTGGAAGCTTCTAGGCACAA 

GTGTGCCACCCTGATTATTCCNACCACTCCATCCAACTTTTCTCTCTCTGTGGGTGTCTG 

CACCACAAGCTGCCTACCCTCCAGGTGCCTCAATGGTCCGGCCACCAGTTGTGCCTCGGC 

GCCCCCCGGCCCACATCAGCAAGTGTCAGGCAGGCCTCCACCCAGGTGCCACGCACGGTG 

CCTCATACCCAGAGAGTAGCCAACATTGGTACTCAGACCACAGGACCCAGTGGGGTAGGA 

TGCTGTACACCAGGCCGGCCGCTCCTGCCGTGCAAATGTTCCTCAAGCAGCACATAAGCA 

CCTATCGGGTCCAGGAGCCGGCTTGTGCACATNCCAGGACAGGAGCCCCTGACCGCGTCC 

ATGCTGGCTGCGGCGCCCCTGCATGAGCAAAAGCAGATGATTGGGGAGCGTCTCTACCCC 

CTTATCCATGATGTCCACA 

Sequence 185 

GTCGCCCCGCGTCCGGGCATTTGTATTTTCAACAATTGTTCTCAAATTTAGAAAAGAGAC 

ATCGCAAGATGGTGAAATAGGAAGCCCTGGGCCCTCCTTCCCTCCACAAACACACTGATT 

TGACAACAGTTCATGGACAGATTCCCTTTATAAGAAACCAAGAAACTGTTAAGAGGCTCT 

TATACCCCAGGTGAGTGCAAAATCATCCACATCAAAGATAGCTGGGAAGTTCAAGACACC 

TTCTTTCCGTMTTTCTAAACTGGCACAGTACTATATGATTGAGATGAATCTCCCCACAT 

CCAAGCTTCCTGCTGGGGAGGAGAGGGGAGGGTATACCATTATGTCCAATGTTCCAACTC 

CTCTAGGAGCTACCCAGGTAGGAGGTGGGTCCAGCTCTGTCAGGCTTGTCTTAAGAGCAC 

TGATTGAGGGTCTGGTATTCTCTAAGTGGCCCAGGACCATAAGAGCAGTGGATGGTGCTG 
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GGGNTNNGNNGTGGGTTACCCATAACCCCCTGGTTTTTGG 
Sequence 186 

TCCGAAGCAGTGAAAATAGTGTTTGACTCCATTGACAACCTAGAGGCGGCTCCTCATGAT 

ATCGGCTACGTCAAACAGGCCATGTTCCACTATTTCCAGGTGCCAGATCGGCTAGGGATA 

CTCACTCACCTGTATAGGGACTTTGATAAATGCACGTTTGCTGGGTTTTGCCGGAAAATT 

GCAGAATGTGCTCANCAGGGAGACCCCCTTTCCCGCTATATCTTCAGGAAGGCTGGGGAG 

ATGCTGGGCAGACACATCGTAGCAGTGTTGCCCGAGATTGACCCGGTCTTGTTCCAGGGC 

AAGATTG GACTC CCCATCCTGTGCGTGGGCTCTGTGTGGGAAGAGCTGGGAGCTGCTGAA 

GGAAGGGTTTTCTTCTGGCGCTGACCCAGGGCAGAGAGATCCAGGCTCAAAACTTCTTCT 

CCAGCTTCACCCTGATGAAGCTGAGGCACTCCTCCGCTCTGGGTGGGGCCAGGCCTAAGG 

GGCCAG 

Sequence 187 

CGTCCGCGTCGCTCCCCTGCTCGGGGTCAGCTAGTGTCCGCTCTGCTCGGCCGCGGGCTC 

CCGGAGGACTGCAGGCAGGATGACCGCAAAAACACGGGTGATTGGTGAATGGAAGTTGCT 

ATGGGCCTCTAAGGGCCATCCCAAGCCCAACCCAACGTTAAACGGTCCACAATCCACCAA 

GGA AGTCAA GCTTTTGACACAAACTGCTGAAAAGCTGGGAGGGTTTCTTCTGAAAGAAAG 

TTTCTTTTTTCMCCCTGGGGGACACTGGTGCCCTTTCCACAAGCCAGGGAATTGGGTTT 

ATGAAGCAAGCTTGGCTCTAAGGGGGGTGACCTCAAGATATTTGCTGGGGGGTTGTGAGG 

TTTGGTGGTTCTTGGGAAGTGTGTCTCAAGCTTTGGGGGCCCTGGAACTGTGCTTGAAGT 

GCCCTCAGGCCTGTGCCCTTCTGGGGCCGGGGGTCTTGTGGGTGNATNCGCAAGCANGGA 

AGCCTGGGGGCCATTGGTCCATTNAAGAAGGCACCCCGGGGCCAAACCTTGCTTGGCTAT 

ANTATTCCAAGCCTGCTTCAACCCNTGGGCAGGCTTT 

Sequence 188 

TCGACCNCGCGTCCGGCTTCGACGCCTTCCCTAACATCGAGAAGGTGTCCAAGATCACGT 

CTCCCGTGCTCATCATCCACGGCACGGAGGACGAGGTGATCGACTTCTCGCACGGGCTGG 

CGCTCTACGAGCGCTGCCCCAAGGCGGTGGAGCCGCTGTGGGTGGAGGGCGCCGGGCACA 

ACGACATCGAGCTCTACAGCCAGTACCTGGAGCGCCTGCGTCGCTTCATCTCCCAGGAGC 

TGCCCAGCCAGCGCGCCTAGCGGCGGCCCCAACCGGCCGGACCTCAGCAATAAGGCGGCC 

CCCGGACCTCACCCCGCGCCGGCCCCCACCCAGGGGCTGCATGTGGACCCCCCGGGCGGC 

CCAGGGGACCCCGCCCCGACCCAGGGGCTGTGGACGATGTACAGGCAACAGAGCTACCGC 

ACTCCTTTCCTTTTGGAAGCAAGAAGAAAATACGTGAAAACGGGAAATTAAAGATTTAAA 

AATTTTTTNNNNNTNNANAAAAANNAAAGTGCGGC 

Sequence 189 

CGCGTCCGAAGCCTTTTGTCTCAGAGAATTTATTGTCCTGACAGCAGAGGCCGATGGTGG 

GATCGACTGGCCCTTAATTTACACCAGCACTTGAAGCCGCCTGGAACCCGACTATCAAGT 

GCATCACAGAGGGGCTGGCGGATCCCGGAAGTCAGAACGGGACACCGNCTTTCACTGTAT 

CAGCGAGCCGTGCGCCTGCGAGAGTCTCCGAGCTGTAAAAAGTTCAAGCACCTCTTCCAG 

CAGCTCCCAGAAATGGGCTGTGCAAGATGTGAAACACGTGACCATCACAGGCAGGCTGTG 

CCCACAGCGTGGGGATGTGCAAGTCTGTGTTTGTGATGGAGGCCCGGGGGAGGCCCGCTG 

AC CCCA CCACGGTCCTGTGCTCTGTGGAGGAGCTGGCACTGGCCCATTACAGACGCAAGC 

GGTTTTGACCAGGGGATTCATGGCGAAAGGGTCCACCTTCAGCACCCTTGTATGGCCTTC 

TTNCTGTGGGGACATCATCTTCATNGGATGGGATTCCCGGATGTCTTCAGAAACGCCTGT 

CANGCATTNCCCCTGG 

Sequence 190 

CNACACATGCGGAATCATAGGCCTGCAAAGCTCCTGCTATTCACTATAACTCTGCCATGC 

CTTAGGCACTTCCTAACCTAGAATTCTGAGTGAAGGACAACAATAACTAATACTTTTGAT 

TCAGGTATTACAAAGAAGTTAAGAGTTCATAAGGCACCTAAGTAAAGTCACATTGGTTAA 

GAGTACATGTCTCCAGATACTCTTACATTTGCAAAGNAATTGCATTTCTGNATCTATGGT 

CTGTAMTMAATTGMGAGTTGNGAGAATAAAAGCATGTTGTCTTTGATAAATTGTTTT 

TACMAACAGGCACAAGAGAGGCTTGMGGGTCCTTGCTATCTTTTAACCTATTTTATAA 

TCTTTGCTGCATAAGAAACAAATATGCTTATTTACATTCTATACTTAAACATATTATCAA 
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ACTTTTATTCTGAAGATAATACACATGMGGCTTTAAACTTTAATCTTCAATATATTTTT 

TAATCTCCCTGATGAATTAGGGAAATAAATATTGGG 

Sequence 191 

TCTCCCCCTTCACCCTCCI I I I I I 1 1 I I I 1 1 1 1 IG I I 1 1 I I CTTAAAAGCATTTAGCTGG 

GTGCAGTGACACATGCTTGTAATCCCAGCTATTGAGGAGGCTGAGGTAGGAGGATTGCTT 

GAGTCCAGGAATTTGAG GCCA GCCTGGGGAACATAGCAAGAGCCTATCTTAAAAAAAAAA 

AAMAAGCATTTCAATTATTTTAAATTTTNTMTTACAAAACAATTCTTTCCTGTCTTAG 

TTTAGTTTAC I I I I I ACTCACAAAGTTCCTGGAAAGTAGTTATTACTCAATAATTGAATG 

CATGAGTGTCAACTGCAAAATCTATGCATTATGTAGNGATTTGAATCAATTAGTCTTTNT 

TGATACTCCAAAATTACCCTTTTTCGAGNGTCTTTATCANAAATTTGATAAATCGGAACT 

T 

Sequence 192 

TCCGCCCCGTGAGGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATG 

GTTGAGTCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGT 

GATGATGCTGGGATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCA 

ACTGCATGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATGGTTGAG 

TCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGTGATGAT 

GC TGGG ATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCAACCGCA 

TCATTTTGGGGAAGACACAATTTCTCAGMTTTTATTTAAGTTGTAAAAATAA 

Sequence 193 

ACACATTGCGGCACCGGGCTGGGCCTGGCCATCGTCAAGCATGTACTGCTGCGCCACCGC 

GCGCGCTTGGAAATCAGCATGTGTGCTGGGCCATGGCAGTACGTTCACCTGCCATTTTCC 

GCCAGCTCAGGTGACGCGCACACGGCTGGTCGGGAATGATGAATAACCTGAAGTGACGAC 

GACCCAATGTGGGAGCTGCTGATACTTTGCTGTCTCCGCCCAGCAGTGGGCACTAGGCAA 

GGGCCCCATCAGCCGCTACATTGGCCGACTTGCGCCTGCCTTTCAGGCCCTCTCGTCACC 

CCTTATTGAACACACGGAACCTGCAAAACCCCATCATGGACCCTTCCCCTGGTATTACCC 

TCGCTACACTCTTCGCCGATTTCGGCATGATTCTTTTTGCACTGATCCTGGTACTGCTCA 

ACGGTTTCTTCGTTGCGGCGGAATTTGCCATGGTCAAACTGCGCTCCACCCGGGTCGAGG 

CCATTGCCCACACCAACGGGCTGGCGCGGGCAGATCCTGCGCACCGTACACAGCCAGCTC 

GACGCCTACCTGTCGGGCTGGCAGCTGGGGTATTACCCTCGCCTTCCTGGGTTTGGGTTG 

Sequence 194 

TGAMTCTCCTAATACACTGNGTTTTTATTGTTATGTATTCTATGTTTTAAAGCTCCTCA 

AGGTAT TGTTATTCTTTTGTATAATCAGTGTTTGTTTGGAGCTGTCTGTAAATTTTCCTC 

1 1 1 I I IGCTCTTTATTTTTTCTTTCAACTCAGACCTTCCAAAATGGGATTATTTTTCTTT 

TTTCTTTAGAGACATTTGCTTGATMCAGAATTTTMTTTGGCGGCTATTTATTTTTTTC 

AGCACACTGCAGGTATTAMCTACTTGGCTTCATTGTTGGTTTTGAAAAATCAGCTGTTC 

TMTGTTGCTCTTTGAATATMTCTGCCTTTTTCTGTACTTGCTTGNCTTTMGTTTTTT 

GTT 

Sequence 195 

CGCTCCCTGGTTTCTTGTCTCATGAANAAGAAAAAATCCTAACTGTTCTTGATGATCTTT 

AAGGCTCANAATGATCTGGACAGAGGTATTTACCTTGAAGCTCATAAAGCATAGGCCTTT 

CTCACTTGCAGAGTATCTTTCTGAAGCTGGACTAAATTGGTTAAGGCCACTGTACTTTTC 

CACTTTGTCTTCTTTCCTGTCACACACCCTATCCTTTCAGGCTGTGTTGGGATGGACAAA 

AGCAGTTCTGGAGTCCTAAGGAGAAGATGAGTGGGATATGTTTCTGTGACCTGCAGTCAT 

TTTAAAGTTTAGCTGTTGCTAGCTGACTCCATGTAAGMTACCTTCCAGGAATTTGATGG 

CTGTGCACTCTGGCAGTGCAACTGGCATGGT 

Sequence 196 

CNCGCGTCCGGCCCTGATTGATGAAGCACAGTCAGTAAATCATCTCTTCATTCCCCAGTT 
CTTAAGCCAACATCAGCAACACTGAGAGAACATTAGATTAAAGGCAGGTATAGAAGAGAG 
ACTTAGGGTAACAAGTTAGTGGGTGCCTGAAGGCATGTGGGAAGAGATGTGGTAAAGGTG 
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TGCACACTATTCCTTACGTGACTTAAAATGTCAAGGGGCTTGGTGTGGTGGCTCATGACT 
GTAAAACTAGAGCACTTTGGAAGGCCAAAGCTGGAGGATAGCTTGAGGTCAGAAGTTCAA 
GGCCAACTTGGGCAATATGGCAAGACTGTGTCTCTACCAAAAAAAAAAAAAAAAAAA 
Sequence 197 

CCGTCCGGAGAGGCTGTTNTTGATNGATACTCCAGAATAGACACGGGCCCTTGAAAAGCT 

TGTACCTTTGGGGAAAAGTAGACCAGGAAAATCTCAAGCCAGAGMGGACTATGAAGCTT 

CTTTTTCCAGTTACACATGGAAATAAGAGGTCTTGCATGAGAAACCTAAATCAACCCCTT 

TNThrrTTAAGCAGGTTTTGGAGTCTAMTTTATACTAATTGCATTGTGGAGCAGGTCTAG 

CCAAGAAATTAACATGACAATCAGTCTCTAATCTGTCTGTCTATCTACTTACTTACCTAC 

GAGTCAGGCCAAAGAGATTCCCACAAGTAGGTTAAAAAAAGTAAAGTCTGATAACAAATA 

ATTAGTAAGAATGAATGAATAAATAAATAGCACAAAGGAACACTCTACCACAAGGAGGAG 

TACTCATCACATCAAATGAGAGAATACCCTAAGAAAGTGAGATAATGCCAGCAATCTGAA 

AGATATTTTACCNAAACGTGTTTAAAATGNTTAGGTTATTAAAAAAAAAAAAA 

Sequence 198 

CTAAGCTGTTATTTTCCCTAAAMTGCTTCCCTTGCATTTATACTTATTAAGAATAGATA 

ATAGCTAACATCTATCCACTGCCTTTGATTCATGAGGCCCTCACGAATTGCCTCATTAGG 

TCTCCGACAGTATGGTACAACTACTCTTTATATTTTACAGATGAAGAAACTGAGGCTGGA 

CTGNTACCTTTTTGCTACACATCCTAATG 

Sequence 199 

MCCTGTTTTGTTAGATGTGAATCTAGGAAATACAATATATTTTTAATGTAAAAGNACTC 

TTGCTTTACTTGTAAACTGATTTTCG 1 1 1 1 1 1 I CCCCTCAGGCCATCAAGCCCTGTCGTC 

CTATGACCAACAATGCTGGCAGACTTTTCCACTACCGGATCACAGTCTCCCCGCCTACGA 

ACTTTTTAACTGACAGGCCAACTGTTATAGAATACGATGATCACGAAGTATATCTTTGAA 

GGATTTTCTATGTTTGCACATGCCCCCCTGACCAATATTCCACTGTGTAAAGTAATTAGA 

TTCAACATAGACTACACCGATTCATT 

Sequence 200 

AATGCAAGAACATCTGGATNAAATGGCTTTCTAGATGAGAATGGTTGC AI 1 1 1 1 IA ATGG 

CTATTCTGGTAGAAAGGACAACATGTGATATTCATCGACCCTCTTTCTCAGACCCTCTTT 

ATMCAGCTAGTATGGAAAAATCTGTCTTTCCTATAAATATTTTCCCTGGGGAATGGNTT 

CCCATTAATGTGTNGGATGCTTTGNTGTTTCTCTCTTGCACAGCGTGTATGTTGTCATCT 

GCACTTMGAAGTGGAATGGAATTGGGAATCCTGGTTTCTTGCTCTGGCTGGAGGTCCCA 

AGTCCGCATTGTGATTTTGGGAAGTCACCGTTGTTCTTCTAGATCTAATTTAACTCATCT 

GTAAAANMNNNGGNTNGAATTCCGACMTTTCATGGGTTTCATTAGATGCCAATATTTT 

ATGACTCATGATCCCAGCNAGACTACACCTATTTATAAGGCTGGTTTTGCTTG I I 1 1 1 AC 

TAAGAGCAACAATNACTACATATTTTCAGGTTACTCAATCATCAAAAAAATTATAAAATC 

CATAAACACTTTGGATTTGAAACATTGCAACTTTG 

Sequence 201 

CGTCCGAAAAGCAGTCTTTCTTGCTCAAAGTATTAANGGTGAACAATTGAATAGAGTACT 

GTGGTCGGGAGACTGATTTGAGACTGCAGAGCTGATGCTGGGTAGAGGGTCTGGACTTGT 

ATTCATGTTCTGTCTCAGGGCAGCCCCTGGAGCAGGAGATGGCAGAGGCATTTACAGCTG 

CAGAAAACAGGGAGGAATGGAATCTGAGGTAGCCCTGGCCTCAAAATTCAGGCCTGGCTG 

TATCATTTACAGAGATTTTTCTGGAGGGAAAAAGTCTCATTTCTGAGGAAGGCAAGGNGG 

GCTAATCATTATTAA I 1 1 I I I I I AAACTTTTTG 

Sequence 202 

GCGTCCGGTTGAAGAGGGCAGGGAATAGGGGTGGGTGAGCGTGAACAGAGTCAGGCTGAT 

TGCTGCAGGGTCCCTTGCATTAGTTCAGGTGAGAAGAGACCCGAGTAGGCCAGTGAGCCT 

GGAGGAGAGGCTCTCTGTGTGTTTAATTGGTTTCCAGCI I I 1 1 I I CTCTATTCATGTAGG 

TTATACACGTTTCTTCTGTGAATTTTTATTTAAATGA 1 1 I I I I GTTGTTACTGGATCTAC 

AAACAGCCCAACTCCAAGGAATCTGGCATCTCTCAGTGGAGCATACAGGTGACTTCATAA 

TCTAACCGCATTAGTAACTGCCAAAATCGGAAGTAATTTCTCTCTGTTTAAAAGGCAGTG 

AAACAAATTTTCAGAGCAGGTTTCTTCAACTGAACAAAATATTTTGGACCTTAAAGGTGG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

37/467 



TATGGCTCTCCTGATCAGGGAGGGACAGTGAAAGGTTTGAGCTCTGACACTGNCCAGCTC 
TCTGGATACAACCAAGTGAC I I 1 1 I I I IG 
< Sequence 203 

CGCGTCCGAGTATATAGAAAAACCATTAATTTAGACTCTGTGAGATTAGGTTGCATGAAG 

AAGGTTTTCTGAATATTTGAAGAGTGGATAAATAAATGTCCCCCAAAGCAATAAAATCAT 

AATCCTTTAAAATATAGGAAAAATAACTAATGGGAACTAGGCTTAATACTCGGGATGAAA 

TAATCTGTACAACAAACTCCCATGACACATGTTTACCTATGTAGCAAACCTGCACATGTA 

CCCCTGAACTTAAAATAAAATFTAAAGTAATAATAATAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 204 

CGCGTCCGCTGGGAGGCTGTGGGGTCTGCCACCCAGCAGATCTGTGTCACGGGAGTGGCG 

CTGTCACTCGTTGAGGTGGTGGCCTGGTTCCTTTGGCCTTAGGGAAGGACAAACTTCAAC 

TCTGAGCCTTGATTGAGTGACCTTGGCCMGTTACCTAGCTTTTCTGAGCCTCACTTTTT 

TGGCNATTAGATGAACCAGAGGTTTATTTCACTCAGAATCCTGTTCACGATGCTGGTATT 

TGGACCAGCCTGCGGGTTTATCCTGGGCTCTTTCTG 

Sequence 205 

CGCGTCCGAAAAAGGATGAGAAGAGAGGTGCATTCCAGAAGACAAAAGGTGTGTAGTATC 

AGGATAAGGGGCTTTAAATATCAGATCCAGAGAACACTGCACATGTAGAAATGGGCTTGG 

CCTGGGTCAGGGCATTGAGATTGGTTACATAATCTTTTCAAGGATTGGTGAATGAGTTGG 

AGTATGTGTAGAAACCTACAAAGATGACAGTTTAATCTCATGTCATAATTTTTAGACAAA 

TAATGTATTTTAAAACTGGGTGCAGTTCCTAAAGCTGTTCTAAAAGTCAATGCAACTGAA 

TTTGGAATGTAAGCATAGGACAAACAGATGGGAAATAAGTCATGACCTCTGTGGGATAAA 

GTGAGAGTTATCAMGAATGTCAGTGTTTATAACAAGGAACAAGCTTGTTTTGGAGAATT 

ACTAGATATTATGGAAAA I I I I I I I CTTTTCTACATTTGGGTAACTATAGCTGAACTATA 

GCAGATCATATTGACTTGGCAAAAAA 

Sequence 206 

CCNCGCGTCCGGTTATGTACTTTTGGAGACTTCCATTAGAAATATTGGCAAGTCCCTGCT 

TCGTGGCCATAGATTTAAMGGCCTATCMTTTTAAATGTTTCGGTCATTGAGAGCTAAA 

ACATGTAACATATCACAGTGTTATTCACCAGAAATAAAAAATCAAGAGTCTGCTCAGAGT 

AGGTTAATATGAGTTCCTTTCTTCAGTCCAGCTGATGGI I I I I AGTAAGATGAACTGCCA 

AGGAGACMTGAGCACTGACTTCTCGATGCATGACTTCATCTTGTTAGAAGGTGGGTTGC 

CGGGCCGCGGTGGCTCACGCCCGTAATCCCAGCACTTTGGGAGGCCGAGGCGGCCGGATC 

ACCGAGGTNGGGGGGATCGAGACCA7TCTGGCTACACGGTGACACCCCGTCTCTACTAAA 

TATACAAAAAATTGCCCGGCCGTGGTGG 

Sequence 207 

CCNCGCGTCCGATGAATAGTTAGCCCATGATAAAGGAATAAAAGGATGAAGAATATTTGA 

AGAGAAATAAATCTTCCTCACTCCTCAGGTTCCCTTCCATGTGCAGGAGCCTCAACCTAC 

AACTAGCAACCTTATCTCCTGACTCATTCCTCTCTAGAGGAGGAGTAAATTAGTCAACTG 

ATATGCTCTGGAAGAAAAACCCA 

Sequence 208 

CGTCCGGTCTTTCCTCCCCCTAAATAATGCATTACAAAGTGGAAATGCAAATTTCCTGTG 

CAAGCTCTMGTAGCAGGTGGTATTTCCTTAATATATTGTTTTTGACCTTTGGGGAAATT 

GGTATTACGAGCTGACT7TGGAAAATTAAAATAGCATCAAGGTCCTACATTTTAAATAAA 

A CAATC G ATATC TTAATTTTTAAAATCAGACTAGATTACGATACCAGGAAAAGACATACA 

TATTTTGCTTTTATGTGTTAAAGTTTTGTAATTCAGGGAGGACAAGAAAAGGGATATGGT 

GCAGCTGAACTTtCTAATTCACTAAGACAGGAAAAAAAAAAAAAAAMAAN 

Sequence 209 

CG TCCGG GAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGCAGGAAAA 
CTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGGGGGTGGG 
AGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTGGTCTGCA 
AAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGACTGTATA 
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CCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCAGATTTGA 

ATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTCTACACCC 

GATGAGGCCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCCTTAAGTT 

ATATTTTAGAAAGATTTCTGTCTAAAATAGATAAACTTGAAAGTATAGCTCTTCAAAATA 

ACGTATTCCTGTGTTGGCAMTATTTTCCAAACTCACAATCAACACATAGGTGTATTTCT 

TAGACTACTAGAAGTGGGGACTTACCCCAA 

Sequence 210 

CGCCNCGCG TCCG GGAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGC 

AGGAAMCTATTTTAAAAATTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGG 

GGGTGGGAGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTG 

GTCTGCAAAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGA . 

CTGTATACCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCA 

GATTTGAATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTC 

TACACCCGATGAGGCCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCC 

TTAAGTTATATTTTAGAAAGATTTCTGTCTAAAATAGATAAACTTGAAAGTATAGCTCTT 

CMAATMCGTATTCCTGTGTTGGCAMTATTTTCCAAACTCACAATCAACACATANGTG 

TATTTTCTTAGACTACTAGAAGTGGGGAC 

Sequence 21 1 

NCGCGTCCGGTTTCNTTGGGATAGATTTTACCTATGMTTCCTCCTTAGAATTCTGAAAT 

TGCTCAGATTTACCCAAATGACAGCCAGTTTCTCATTTCACATTTGGGGGCTGTAGAATC 

TTCCMCATTGAGAACCTGTTTTAATCAAAGGATGCTTTGTGGAATCCTGAATGAGGAAC 

AGCATGTTGCAGGMGAAGAGAAGGATCCTGATGCCCTAATGGGACTGATTTCCTTTTGG 

GGGGCAGGAAGATATATATTCGTTGGGTGCTTATAAAAGGTTAATTCCAAAGATTGTGTA 

TGGTTAAAGGACTGAAAGTCACACTTAGCCTCATACTTCACTTAGATGAAAAACAAAAGC 

CTTCCTCTCCATTACCTTGTAAGATCTATTCCTTGTGTCTTGTGCTGAGTGGACCTGGAA 

TAATGGATAGCCCTCACTGAGTACCTAGAAGGGGACTAGGGGTGGGTGATGAAAGGGGGT 

TCACACCGAAGATCTAAGTGCTAGCTTGGGTA 

Sequence 212 

CACGCCCCGTGGCCTTGCTAGAGATCCATATAATGCAGTCATGCTGTTTCTTNCTCCATA 

GTATGTGGGGCATGAGGAGGAGACAGGGAGAGGGTGGCTTCATTGNGCAAANGNGGAATG 

GCTGTGCTTTGGGGCCAAGGAGATGCTGTCCTGCTGTAGCTGCTCTGTGAAAGGTCAGGC 

CTGCCCCTNTGAGGCTCCCTTTATCCTCCTAAATTCTGGGGCATCTACATGACGCTTTCT 

AGTCCACCTTTGCCTNCGCAGATCATGGCTACTAACCTGACCTTTGTCTGTACTTGAGCA 

CCCTTCGCGATTTAACTTNCATGTANCGTCCGACTTCTMTATGGATTTGAATTTNTTGA 

CTGTTACTGCTCANAACAATCACCCTTTTTTGAGCAGNGAGCTGGNAGGATAATTGCCGA 

CAAATGACATTNGGANCCGTTTTNAACCACAGGGGGCATGGGG 

Sequence 213 

CGATCATTTTATTAGAGTNATGTATTTAAGAACTGATAAATCATGGGCTTACCTACACAA 

TGTCTAGACACATGAGCAATGAACAAATAGCAAGGTCTGTGATATCTCATATGGCAATAC 

TAGGACTGAGATTATTTTTGTTACAATTAAATAATTGTCAGTAAAAATCCACAGAGATCA 

TTTAGM TGGG MAAAAGTCTGATA TATTTTG TTTCAGATTATMTCATTAATAGGGTAC 

CTGACAGTTTTCAAAGTTGTTTAATGTTTTTTAGGTCTTTM 

GTGGTTACAATCCCACATATTATCCTTGTGTCAGGGGTCTCCAGCACCTGCCTTAGGGTC 

AGTGATTTGCTAGAAGTACTTACAGAACTCAG 

Sequence 214 

ATCTGACAGCCTGGAACNGCACCCACACCCCCAGGTGAGAATCTGATGTTCTGGAGCATC 

ACACACAACCACAGGTGAGCATCGGAGAGTCTGGAGCAGCACCCACAACCCAAGGTGAGC 

ATCTGACAACCTGGAGCAGCACCCACACCCCGAGGTGAGCATCTGACCTCCCGGAGCAGG 

ACCCATACCTCCAGGCGAGCATCTGAACCCATGGAGCAGCACCCACGCCCCCAGGCGAGC 

ATNTGACCGAACAGAGCAGCACCCCCTCTA 

Sequence 215 
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TCCGGGAAAGACTCTGAGAAAGGACTCCTGACCTGGCTCTTCCAGAATGAGGATGGATTT 

TCCATATGAAGGAGCCGGGGGCAGGACCTTCTGGGTAGAGCGTGAAAAGAACACAGTATG 

TGTGGAGAGCGGGAAATTGTGTTGAACTGTCTAGAAAACAGAGCCACAGGAATGCGTGTT 

GAGGGCTTGGGCTGTACAGTGGAGAGTGCCCTCCCTGGCCAGGGAGTTGGACATTCATCC 

CACCACAAGACCCCATGAAGAGTTCCTCAACAGCTCTGTGTCCTCATCAAACCTGTGTTT 

GCAGAACAGTGGAGGAAGAGAGCTGAAGGAGGGGAGAGGCCCTGCACCTGCCAGGCCCTG 

GCCTAGACTACAAGGGTGAGCACTGAGCCATGCTCTCGGGGAACCTTCACTGGAGTTGAG 

GGCAGTGAGAATGTTTAAAAAAAAAAAAAAAAA 

Sequence 21 6 

ACGCGTCCGATGCACACTTGTCTTTTACCACANGGGTGGGGCGTGGGANGGAGGTTTAGT 

TTGGATAGCCACGTAAACGCCTTTCCCTGTGGCCTGCGATGTTCCACACCGTTTATGTGT 

GAACTGGCTGCACCCGCGCCTCCCGGACGGGGCTGCCAGGGAGGAGGGCCCGGGAGACCC 

CATCCAGACCCCGGCCCGCACGCTGCAGAGGTCTGCTCTCAGACATGTGGTGGGCTCCGT 

GTCACGGGTAAGGGGTCGGGGTCTAGACGGCAACAGAGTGTCTCTCTCTTCCCGCTCCCT 

GGTGTGCCCACCTCCCTGTACAGTGTTTCTGTTCAAGCTGCTGCAGGGGACGGGGCATTT 

TTCGTCCAGACTCTATTTTCCTGCAAGGMGAGCTGCTGTCCTTTTTCTTACTGAAGCCC 

CTGATTCTGTGTCTGATGTTGCTGACCGCCGTGCTTGCTTTCTTGCCCGTGTGCAACTTC 

CAATCCCAAGCACACGTGCTCACTTCCAAG 

Sequence 217 

GCGTCGGGGGMTGGCTGTNAGTNMGTTAGAGGTAAMAMTTTCATGTTAAGATTTTG 

GMCTGGATTTTATTTAMATAATGATGCGMGCCATTGAAAGGTTTTTGGTGTGACAGG 

ATAAATT TAAAA TATGMCACACCAACGCATACTTCTTTTAAGAAAGAACCTGATTAAAT 

TTGGGAATT TTAAAATAAAAACAGGAAGCATATCGTACTCTAATATAATAATTCAAGGGT 

TTTTATTTTCCTAGAAGATCAAGGTCATGTTAATAAAGGGAATATAGTTTTCTTATCTGT 

GTTAAGACACTGATGACTTGCAAAGAAAAGTMCACTTTTGTGATATCCTTAGGTAATTC 

AAGAGGAAACGCTTGAGCAATTACTGATGTTGTAAACTGGGATCAGAAGACATA 

Sequence 21 8 

NCCCCGCGTCCGCCCAGTTGCAAAGGAGATGTTGTAGGATGTTAGGTCTCAGCACAAGGA 

ACCCAAACCTTCAGGGGCTCTCCTCTACATTATGCTCCCAI I I I I CTCCCAAAATATCGA 

TCTCCACCCACCCTAGACATAGMCTGGAGMTMGTTCCAGTTTCATCCCTTTGCAGAT 

CTTAGGGGGACCCATCAAATTCCCAGCCACTGGGTGAAAATCAGCAGCTTCTTTATAGGA 

CCTGAGTTGCCTTCCTAGAGGATCCTAGAGGAAAAAAAAAAATCTTATCCTTCAAATACT 

GCTGTCTTCCAAAATACGTAAGGACGCCACGGTGAATCATAGTGGACACCCTGCATTGGT 

TGGGTTATTATTTATCCTAGMGCTTGGGTTCTTGGAGCCCTAGCTTATTTAAGCAACAA 

AGTCCCTCACAGCCCACAGGTGAGGAAGTGAGTGTAACAAAGAGATCATtGGACCTAAAA 

TCAAAACACCTG 

Sequence 21 9 

CCCGAAAGGAGGTTTGTGGAACTTGGAGAGATCCAGGAGGTAACACCAAAAAGCTGCATT 

TAGCMTGCCTGCCTAGCCCTCCTGTACAGCTCATGTTATTTTGTTGCCTTAGGCCATCT 

TACACCMACCA1TCCTATCTCCATGCTTTTGTTTATGCTGTTGCCTCCTTTTAGAAGGG 

CCACGCTCCACCGCTCTGCTGTGTTGATACTACCGACCCTTCCTTTAGTTCCTGTTCAAT 

TCCCAAGCCTTCTGCCAAGCCTTCTTGGACATTCCCATCCCATGCTGACTATTCCTTAAG 

CTAAAMCCCTTAGAGTTAATATGATACACTTGGCTACCTCATTGTTTCATATCAGTCAT 

TCCTGTCTCTAGCTATAATCGGCTCAGCCAAGGAAGATATATTTATATGGACAATGTCTT 

TGTGCCTTGGCTGTAMCAGTGTTGAATAATTAATCTCACTTGATGAGGTCTTACTTAAT 

GATAGCCTCC 

Sequence 220 

CTCGTCCGGGCTGGATCCGTCTGCNCCACTGCAAGGGCAAGATGCAGCTGGTGGCTGACC 
TGCTGCTGCTGTCGAGCGAGGCGCGGCCCGTGCTCTTCGAGGGCCCCGCCTCCTCTGGTG 
CCGGCGCCGAGTCCTTCGAGCAGTGCCGGGACACCATCATCGCGCGCACCMGGGGGTTT 
TCATCCTTACCCACGACGTGCAGAGCCAGCTCAACATGGGCCGCTTCGGGGAGGCGGGGG 
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ACAGCCTGGTGGAGCTGGGCGACCTGGTGGTGTCGCTGACCGAGTGCTCGGCCCACGCGG 

CCTATCTGGCCGCTGTGGCCACGCCGGGCGCCCAGCCCGCGCAAGN 

Sequence 221 

GATTTTGGCTNCTGCAACCTCCGCCTTAGGGGTTCAATGCAATTCTCCTGCCTCAGCCTC 

CAGATTGGGATTACAGGCATGTGCCACCGTGTCTGGCTAATTTTCTTGTAATTTTAGTAC 

AAAAGGGGTTTCACCACATGGGCCAGACTGGTCTCAAACTCCTGGCCTAAAGCGGATCCA 

CCTGCCTCGACCTCACAAGGTGCTGCGATTACAGGGATTACAGGCTGGGATTACAGGCAT 

AAGTCACCACACCCAGCCTAAAAATTAAAAGTTTATACTGTATTGTTCACATTGTGATGC 

AACTTTCCATATGTATTTCCAGAATCAACTATGTATCAAGGAATATTGAAAGCATAAAAT 

GAGATCATGGTGAATTCTCTCCCATTGTCATTCTTGNGGTAAGGGAATGAATGGTGG 

Sequence 222 

CNCGCGTCCGAGTTATTNATATAAAGAACATTTTTCTGTTTTAGAGAGAACCCATTTATT 

TGTGAAAGAAACAGAGTTTTGTTCTTATCCCTATAAACACTGCATTTGCTGTTTCTTCTC 

TAGCTGATGTGACATATTAGGGAAGCAGCACCCGACTGGGACTCAAAAGACCTGGGTTTG 

GAAACTTTTTGATTACTAGCTTAGCTGTGTGAAATCAGGCAGATGGTATAACTTCTCTCA 

TTGATGATGTCAATCTTTTTAAAAATTTTCTAGCAGTGAAATTTTAAAAAATGAAATGTT 

AGGTGAAACCTCAAACTTACAACAGTGGTAAAATTGGAAGGGTTCTGGATGGAGTGAGGG 

CAGGGAGGGGGTAACTGGGCTGAGGGGCCCCTCTTCAGCCTCTTCCTCCCCTTCAACCGA 

AGTCTGAAAATCTCTGGCTMTATGGGTGGATCCCTMGATTTCTTTTGTACTTTTAACA 

TACCGAAAAGNTATTCTAAAGAATTTTGGTATGGTTTT 

Sequence 223 

CGCNTCCGCCGAGCGCAGCAGTCTCCCCCTAACCTCAAAAGCCTCCTCAGAAGTAACCTG 

GTATCAGTGGGCTGTGCAGATTCTTATAGTCTTCTTTGCCTTTATATATGAAAATAACTT 

TGTTTTATGTTTGTTTTCACATAGGTGGAATTATACTGACTCTTACTCTGTGACTTGCTT 

TCTTCACATGACAGTAACTCCTGGACTGTTTCTGTAGCAGCCACACAAGTCTCCTTTATA 

TTGCCAATATCTATCTTTATCCCAAAGACAAGATGAATGTTCAAAAACGGTAATCCAAAC 

TCACTTCTAAAAATGGGTGTGTGTTTAAAGAAGAGCTGCTCAGCTGAGGCAGTTTTGCTG 

CCGAGGGGATTTGAGCAATGTCTGGGGACGTGTGGTTGTCACAACTTGTGAGGGGGCTC 

Sequence 224 

GCNTCCGTCAATTTGCATCAGTTGCCTTAAAGAATGGGGTAGTTATAGGAAGCTCACAGA 

GAGGMAAAMCTCTTCATGTCTACATTGCTTCTTGMCTGATTTTATTTATAAAAGAAG 

AATTGTTGAGCTAGTGATAGAAGTTTCATGAATTCCCTGGTACTAATTATCAGTTAAATG 

ACCTCTCTGAAGTCTCTGGAGAGCGTTCTGTGTCACTAATATTGGTGAAAATTTGAAACA 

AAMTGCTCTCCCATTGTCCACATATTGCTTCTTTGMTTTGGTTTTTCGAGCCAAGAAC 

TTAGGGTGTGAGAATATGTTTGTGGGGAAACCCACACAAATTTTATGTTAGTCTCTGTAC 

ATTTAAATTTTTACCTTCCTGATTACTTACGTAAGACTAAACAATTTAAGTTTCTAAAAT 

GCCATCACTTTTGCAAATAAAGGACTTTATTAAAGNTGATAAATAACAATNATGGGCCAT 

CAGCTCCACCTATAATTAATATCCTTGCCTGGCACCCCTGGAAGGGACTTAGCTTNTT 

Sequence 225 

CNTCCGCAATTACTGCCTTAGGCTAGATTCCCATGAATANGAATTGCTGAGTCAAAAGGT 

TGACACATTNTTTMAGGTTAGATACATATTAGTCMAGTTTTTAAGCAANATGCTTTCT 

MGCCTNTTTGATCTTTATAANNCATTGNTCCTTTCTAAAAATATATAACTGTCTTCTGC 

GTNCCMGGATMTTNTTTTTATTAATATGGGGCTTCTTGTGTCTACTTCCTCCCCTTTC 

GTTATTTCTTCAAAATGTTTAACAAACTAATACATTTCAGAACACATTATGCTTNCATCT 

TGTCACTATTTGCAGTACCTTGTATCTCCTGGACTTTATGCAATGCGTGTGTGTGCACAG 

AtGGAATATGTTNCACCATTTGGCT 

Sequence 226 

GCAGTGCCGGCGGCGCAGTTGGGAATGGGAGTGCGCTTGCACAAGCGCCGACGAGCGATA 
GCGACGCTATGCCCTCTCTTGCCCAGGCGCCGGGTGGCGCGGATCAACCTGCTCTCGCAG 
CAGCCCCTGTGGTGGCCGCCGGTGATGATGCAGGAATCGTATTTCGACGGTGGNCAGCTT 
CGCTTTCGATCAACATCTTCGNCGAAGTTCCGCGTGAACCGTNCGGNATCGACATCGTGG 
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CGCTGGCATCTGCCGCTATACCAATCATCGAAACCGCGCTGGAAGGTCGGGTCCAGCGCC 
CTGCAGAAGAGACGCCTCCAGCAGGACAAGGTGGCCTGCTCGACGAGCAGGCTACCCGAG 
TGTCGTTGAACAAGTATCGCCTTGGGTCT 
Sequence 227 

CTGACACCCAGTATTGCCMCCTMGGACTAGGMGAAAGGGCTTACATGTTTCTTGCTT 

TMTAATATGTGMTTAGAAMGGTATTCAAGAAGATACAGGGTTAGAATGTGCATCTTT 

CTTTTATGTACTGAAATGTTTCAACTCATATGCAGCAATGTGGTNACCTTGAAAGATGAT 

TCCAGAGATTTCATGCTTCTGGCAGTTTCAGGTCACATTGGGGTGAAATTTGACATACTG 

TAGTCATCCCCAAGGTAATAATGTTGATGAAATGGTATAAACCCTGATACATTCTTAAGC 

AAAATGAATCTAAAAGTTTGTTTCAAAA I I I I I AACGTTTTATGTTGCTCTGACTTTCCA 

TATACTGATTTTTACATTACTTAGTGAAAAAAAAAAATTACCTTTTAGCTTNTGGCAACA 

AAAACATTTTTGG CTATT 

Sequence 228 

CCNCGCGTCCGCMGACAGGAGATTTCTTAAGGTCATATATGCACGATTTGCTTAAACGT 

CATATAGCATAACTGGAAGAATATAGAATCTACCCAAAGCCTATGCTCTTTCTACTGNAT 

TATTTGCTTATAATAGAGAATGCTAATCAGAATCACCTGAATAGTCTTGTAGAGGGTCAG 

TACACAAGCCTAGGTCCTATCCTTCAGAGAGGTGAAATGGAGGCAAATATATTCTGAAAG 

AGTTTCCCAGTTGATTCTGATGAGCACTCACAATTAAGAAGTGCTGCATTAAGAGCTGTA 

CATGGTGGCTTATGCCTGTAATCCCAGCTATTCCTGAGGCTGAGGCTGGAAGATCACTTG 

AGCCTGGAAGATCACTTGAGCCTGGGAGTTGGAGACCAGCCTCGGCAACGTAGTGAGACC 

ATGTCl I I I I I I 

Sequence 229 

TTGGGAGATATGGCANGGTGAGMTGTTTGGGAAAGGAAGTAGAGACAGACATTGGATTT 

TTGTCAGTCAANTTTCTTTTGATTTTAAATTATTTATTCATTACATTGTC 

TTMTATTCTAAMCTATMACMAATTTAATATAGAAAAAATATTATGCTTTTCTTT^ 

TGCCTTTGCATGCTCTTATTTCAAMTAGATTTAAAATTCATGGCATATTATACTAGAAA 

ACAAGTCTGTCAATGATCTAAGCTTCTATCTTTAGATACTAGGAAAAAAGAGAAAAACCC 

AAAGCAAGCAGAAGCAAGGAAATAAGAACAGAAATAAATGGAATTGANAAGAACACTTCA 

GAAAATCAGTCAAACTGAAGCTGATTTTTATAAAGATTA 

Sequence 230 

CCACGCNTCCGAGGAAACTGGCAGGGAAACAAAGTATCCCTGGAAGGAAT7TCTTAAAGA 

GGAAGAGGCAGATCCCTACAAGTTTAAAATCAAAGAATTTGAAGATGTTGATCCCAAAGT- 

GAAATTGAAAGATGGACTTGTGAGGAAGGAGAAAGAGAAGCATAAAGATAAGAAGAAAGA 

TAGAGAGAAAGGCAAGAAAGATAAAGATAAGAGAGAGAAAGAAAAAGTGANAGATAAAGG 

CAGAGAAGATAAGATGAAAGCCCCAGCACCCCCACTGGTGTTGCCCCCAAAAGAGTTGGC 

CCTGCCCTTGTTCAGCCCTGCCACAGCCTCCAGGGTCCCAGCCATGCTGCCATCTTTGTT 

GCCAGTGCTTCCGGAAAAACTGTTTGAGGAGAAAGAGAAGCCGAAGGAGAAAG 

Sequence 231 

NCTAGACTCCCCTCTCGTATCATGGATCCCAACATCNAGGNATATGGNCATTTACGTGTT 

GGGATCTGCTCTGCCATTGNACACAGCTATATTCNATTGCCCCGGGNGTTGTGTATNTTT 

CCAMAACGTTGAMGGGAGGTTCAGMGTATNCAGTTATTNGTATTATTAGTCGTTTTG 

AAACTGAGTNGAAAGACTCATTNANGAAAGNTCCATATGCCTTCTTGTCTGTCTATGGCT 

GGNNTGCTCNNGAGAAAAGTCCNCANTTATACAATTGTA 

Sequence 232 

TTTCCTTTTTrTGGGGGCGGGGTCGCGCTCTGTCGCCCAGGCTGGGGTGCAGGTGGCGCG 

ATCATGGCTCATTGCATCCTCMATGCCTGGGCTCMGCMACATCAGTTTTCTTATCTG 

TGAAATGAGGATAAAAATGTCTCCACTTAAGGGTTGTTGCAAGGAAGGTGTTGCCTTAGT 

CATAAAAGCTAGGGAAGGTGTTCCTAACGAAAAACAATTCGTCAGAGACATGAAGGTAGA 

GGAAGAATTCACACATGAAGGGGGCTGGGGAAAATGATTTAAGAAAAGAAACAGGCCTGG 

CGCAGTTGCTCAGGCTTGTGATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGGATCACCT 

GAGGTCGGGGAGTTCTAGACCAGCCTGGCCAACCGTGGTGAAACCTTGTCTTCTACTAAA 
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AATACAAAAANATTGGCGGGGTGTGGTGGCAGGTGCCTGTATTCCCAGCTACTTTGGAGG 
CTGGGACGGGATAATTNCTTGAGCCCCGGAGGCGGGGGTTTCCGGTGAGCCCGGGATTGC 
GCCCCTTGCACTACAAGCCTGGGGCACANAAGCGAGGACTNTGTCAAAAAAAAA 
Sequence 233 

CCACGCNTCCGGTCTCAAATCNCCTCAAGTATATTCAAAATTTGACCACTTCTCACCAGC 

ACCACTGTCATTATCCTGATTCAAGCCTCCATCATCTCTCATCGTTACTGTGACCTCCTG 

ATCATTCTCCTTGCTTCAGCCCTGGCCCCTGCAGGCAGCATTCAGTATTAAGCAGTAGAG 

TTGTTTCTATAAAAATGTAGTCAGCTGGGTGTGGTGTCTCACGCTGTAATCCCAGCACTT 

TGGGAGGCCAAGGTGGGAGGATCACTTGAGCTCAGTTTGAAGACCAGCCTGGTCAACATG 

GTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACTGGGCATGGTGGCGGGCACCTGT 

AATCCCAGTTACTGGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAAGGCT 

TGCAGTGAACCCAGATCACACTACTGNTTTTNCAACCTGGGCACAGAGTGAGACTGCCTC 

AAAAAAAAAAAAAAAAAAA 

Sequence 234 

TCGAGGTCAAGGACGGTTATGGCCCGTAACTACCTGCTGCCGCAGAATTACGCCATCAAG 

TGGACGCGCGGTGCTGAGGCCCAGATCAAGGACATCACTCGCGCCCGTAAGGCTAAGGAG 

ATCAAGTCCAAGGAGGAGGCTGAGCAGATCCGCTCGCAGCTGGAGCACCTGGTCGTCCAG 

GTGACTGTCCAGGTCGGCGAGAACGGTCGTCTGTTCGGGGCCGTGACTCCTGGCGATATT 

GCGCTGGCAGTCAGGAAGGCCGGTGGCCCCGCCCTCGAGAAGCGGTCCATCGAGATCACC 

AAGCCGATCAAGACCATCGGCAAGCACACTGTCCGGCGTCAAGCTGCATGACGCTATTAA 

GGGTCACGTCACGGTCGAGACTGTTCCCGCCGCGTTGATTTGACGTACACGCAGANTAGG 

GGGAGGGGCATCCAACCTGGGTGCCCTTCCTTTGCTTNCGTCACCACCGGCGAAAGGTAA 

ATGACCGAAAGTAATTCTTATTACNGTGCTTAGGGGGGTTTCGCCGCGCCGGGCCGTTAG 

ACTTAGTCTAGAAGAAAAAACCTTCCACACCTTCCCCTTGAACCTGGAAACATTAAAATG 

AATGCNATTGGTGGGGTGGTAAACTTGG 

Sequence 235 

GNGGTAGCTTTGTGTATGTCGGGCACTTCNAGGAATAGGGTGCAGGAGAAACGTCTCAGT 

GTCTCCCCTTCCGAATCTTGGCTTCTGGAGGGAGAGATGCTGGGGTGGGAGTGCTCCTTG 

GTGGAGTACTCAGGAGCTTAGTAAAAGCAGAGGGGGCTGGAGAGGCAGGCCTGGCCTGCA 

GAGCCAGCATGGAGAAGCCTGGTGTAGGGCTCTCCAGCCTGCCAATTTACAGTTAAGAAG 

AAAGGAGATATGTATATATATATATACACACACATACATACATACATACACACACACATA 

TATACACACATATATATACACACATGTATACATATATATACACACACATACACATATACA 

CACACATGTACACATATACACACATATATACGTGGGGTGTGTATATGTATATATGTGTAT 

ACGTATATA 

Sequence 236 

AACTGTCTAATCTCTTGACCTCGTGATCCCCTGCCTCGGCCTCCCAAAGTGCTGGGATTG 

CAGGCATGAGCCACTGTGCCCAGCCATAC 1 1 I I I I I I I I II I I CTGGNGCATTTCTGAAG 

TTAMTATGTTGAGTTCTCTGCCCATTTGTTCAAATTCTATTGNATTATTTCTGNNGGAT 

TGACAAATGTTATGAACAATTTGGTTTCAAAATAGA I I I I I CTATTTAGCAACTTTCTAT 

TCCTCTAGGACCTAGTATTAAGGTCATTAAGATGGCTTAATGAGTCATTAAGCCATTAAT 

GAGTCATTAAGATGGCTGACCAATCTGTGCCATCI I I I I I AGTGACATTTTGTTTATCAG 

TGTCTCTTTTGTGTATTTCTTTGATTTATATCTTGACAGTATTACATAAGCAGGAATAAA 

AGAGACTTTGAGTGGGAATGTCTCGCTCAAATATAAATAACCTTATAAATACAATCTTTA 

CTTCTCAGAGTCCTAAAATCTTCTTGGTTAGTCTGGTCTTCCTGGGTTGATTCAAGGTTC 

TCGGNGACTAAATGGCAT 

Sequence 237 

TGTTNACCTTTGGGATTACACAATACTTGCAATCCAGCTCTGCCATGGGGGACATCATTA 

CACAA AGCACTCCTCACCAGTAGTCAGATGGCACTTGATAGCTACAGGCATGTGAATATT 

CTTTTACTGTCAACTTTTGTTGTGATTTCACCTAAAATATGATAAGCTCCTTGAGGATAA 

AGGCTAAGTCTTACTTCTTCGTATTTCTGGTAACCGCTTGTACCAAACACCTGCCAAGCA 

TTGTTGATTGCACCTATGAGGAGGTGAGAAAGTGCCAGGCCGTCATGCCCCTCTATACAA 
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ACATGTAACAGTAGACTCTGGTTCATCAAGGAGTAAATGACATGATATGTGATAGACCAC 
ATACATTCGTGGCTATAGATAGGGTGAGTAACTGTAACCATATCCAATTGTGAGCAAAAA 
GTGTTTAATTGGGGNGGTTTTMACTMTATAAMTTGGCTAAATCATCATTTTTCAAGC 
TCTCGAATCATCTGAAGACC I I I 1 1 1 IAI I I 1 1 I AANANTTAGGCC 
Sequence 238 

CGCTCCGTTGCAGTGAGCTGAGATCACGCCACTGCACTCCATCCTTGGTGACAGAGCANG 

ACTCCATCTCAAAAAAAAAAAGAAATTGGGTTGTCTAGGCACAGTGGCTCGTGCCTGTAA 

TCCCAGCATTTTGGGAGGCTNCNTAGAANAATCACTTGAGCCCAGGAGTTTGAGACCAGC 

CTGNACAACATAGTGAGACCCCATCTGTACCAAAAAAAAAAACACCAGACCTGGTGGCAC 

ATGTTTGTAGTCCCAGCTACTCAAGGGCTGAGGTGGGAGGGATCACTTGNNCCCAAGAAG 

TCAAGGCTTCAGTGAGCCATGATTGGTGCCACTTGACTACAAGCTGGCAACAGANCCAAG 

ACCCTNTTTAAAAAAAAAAAAAAAAAAAAATNGGNTTGGGTTATAGGNGAAGTTNCCTGG 

AAAACCTTACAAAACAAGAACCATGTTTGGGGGAANCAANTGGANGGGGTNAAANTGNCC 

NCTTGANGCTTNTCCTTTNGNGGNCCNTAATTAAAAANGTTTCCCNGGTNTTGGCCCNGG 

GNCCGGAAAATNCCTTNGAATTTCTTTAGCCCCTTTTGGGGNGGGCAAAAGCCAGGGNGG 

AATCCCNCNNGGGNTCAGNGATTTTNGNAACCNNACCTTGGCCAAAAATNGGGGAAAACC 

CCNGI I 1 1 I I I INTAAAAAATAACNAAAAANTTTNCCCCCTTTNCTTTNGGGGGGGCCCA 

CANNNNCCCCCTTTTTTNATANANAANAANAANCCCCCCCCCCCCCCCCCC 

Sequence 239 

CATCTCAAAMAMACTCACAACATATTTTATGTACACCTAGTATCAGTGAGATTTTTTA 

ATAATTAGATTTATTATC7TCCATATTGNNTACAAAACTTTC7TCAACTTCATAAAAAGT 

CAAGTAACATACTGTACTTACTTCTCTCACCTCATTTTAAAATAGTGGATAAATTGTCAG 

ACACAGTGACTCTGCACCTGTAATCCTAGCTACTTGGGAGGCTAAGGTGGGAGGATTGCA 

TGAGGCCAAGAGTTCAAGACCAGCCTGAGCAACACATAGAGACCCTATCTCTTTAAAAAA 

AAAAAAAAAAAAAAA 

Sequence 240 

TGTCGACCCACGCGTCCGCGCTCCTGTCATCTCCCTTGGGTCTTCATTTAAATGCCACAC 

CAGAGAGGCCCTCCCTGGCCACCCTAATGAAAACTTCAACATCCTCAACCCTAACATTTC 

CTGTCCCCTGGGTTATTCCTCCCCTTGGTATTTATCACCATTTAATGTACTATCTGGCCG 

GGCATGGTGGCTCGTGCCTGTAATCCCAGCACTTTGGGAGGCCCAGGCGGGCGGATCACC 

TGAGGTTAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAA 

TACAAAAATTAGCCAGGCATAGAGGCATGCCGCCTGTAATCCCAGCTACTTGGGAGGCTG 

AGGCAGGAGMTCTCTTGAAACCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCAAGCCAC 

TGCACTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAACAAAAACAAACAAACA 

AACAAACAAACAAACAAAAAAACCCAACAAACCCACCAAACATACCATATTTGGTTTCTG 

TAAMAMTAAMMGAGAGAGAGMTTTTTAAAAAACAAAAACCATACTATCTAATTTA 

CAI I I I IAATCT 

Sequence 241 

CCCCGCGTCCGCTAAGGCATGTGAGCGCCTTGTAGGGCACG7TCTGTCTTCTGACTACAT 

MGCMCACTTTTGCGACCCTAAGTCACAGCACCCACAGCTTTTGGCAGGATACTTTAAA 

ACAGAAAAAACACATCATTGATCCTGGCAGGATTTTTTAAGGGATTGTGCTTAAGAATGT 

TTMCTTTGGTMTCAGAGACCACCACTGGTGTTTTTCTCCAAAATCACATCTTTAAGTT 

AATTAATACTTGAAGTTAATAAATAATACTGCTCAAGTGTATTAGTAATGATGCCATAAT 

ACC ATGT GAATTTATGCTGATTCAAATGTTG I I MM I CGTTTGATACTCATATGGCCTT 

CTGTTTTGAGGACTTCAGATTATTTAGCAACTGATTTAATCTGGTCAAGAAATAAACTTT 

GCTTCAGCTGGAAAGCGTGAGGCTTGAGAAAAGCAAGGTTTTGGACAGGGGACCTATGAA 

GCTCATGTTGAMCTTAAGTGTTTTAAGGCTGTATGGGAACTTGGAATGGGAGTGAAAAG 

AACCAAGCCGTCTACTGNCAAGGTTTTTCCCTCCTCCCTCTAAAATTTATAAACCTCATT 

CTTAGAAGTGGCAAAAAGTTGGGAACCTTTTCCACTGNTTCACTTCTNTTANTNAGGGGA 

TTAAAGGNGGATNGGNMANGGAAGTTTTCNTGGTTTTTTAAAAAAAANNGNANAANGGG 

GGGG 
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Sequence 242 

GNCACGGCGGGGCAACCTGGTTCCCCTGGGCGGTAGTCCCGGCAGCTGTGCTTCCCCCTT 

CACTGCAGTGCGTGTTCAGTTGGAAACACGGAGATTTCACCTTTACTCCCTGGCTATAGG 

GGGATGGTGAGAANGGGGCTGCGTGTNAGGNGANATTCAAGGCATAAATCTTAGNTTGTC 

TTTTAAAGAATTGNGTGCCGGGACGTGNTTGGAGGNTGGGTGCANANAGGAAGAGGCGAN 

CTNCTGGAGGANGCAAAGGACAGGTCANTNTGCGNANTTCNGNNNGATGCTGCTGANTGA 

ACCCCNGNTGCTGTNGCCCCAACAAGAAAGGCTNNGCTNNGATCAAACATTTACTGNACC 

TANAAGGCCACATTCTNGCTTAANNANNCCAGGAANGCNCNTTCTTGTTNCCCATTACCA 

AAATCATTNACACTTGGTTTTTGGCCCNACATACCTTTCCTTTTNGACCANNGTTCTGGN 

NNAAANGGCANTANACCNNNGNATNTNNCNTCATNACANGAATAAAAAGCCCNCAGAACN 

CNACANANGNGCCCATCACTANNNNTANAGATNTCAAAAGGGGGCCCNGGATCCTTNACC 

ANAGAANACTCTGGTCCNNAGNGNAAGAAAGNAAAACCCCNCNANANGCACTGGGACNNT 

NTNATANTTCTNNTTNAAACAAAANCTTATAANNNAAAAAAAAAATTTTT 

Sequence 243 

AATTAATAAGCCTCCTGGGTGATTTGGCTGCAGGTGGTCCTGAGACCACACTTTAAGAAA 

CACTAATCAAGAATCAAGGCCTTATGACTCTTAGTCCTGTTTCTTCCCAAAGGACTATGT 

TGCCTCCCTACATTCGAAAACATTCCAAACAATACATAAGAGTCTTTGTAATTCACATTA 

GTGCTCGTATTTACTGACTCMCMTTMTTCCTGATTGAGTCATTCATTGAGTAATTTT 

CTTTTACTCACTCAGCCTCTGTGGTTTGCCAGGAGCTACAAATAAAAGATTAAAACACAG 

TCACACCTCCAAGGAACTCATATAATAGTAACAGAAGTAACTACCATTTCTTGAGCACTT 

TATTTATACATACCGTCATACAGTTTATCTCATTCACCTATCTCATTCCCTGCCTTTCAG 

TAAGCCTGTATACTATTATAACCGCATAGTTTGGGAATATTTCATATATATATATATATA 

TATATATATATATATATATATATATATGTATAATCACATATATTCTCTTCAACTTGTNCT 

TTACATTTAATATGTCTTAAAAGTATTTCTTTGTCAGTTCTTGCTTTATATATGTCTAGT 

ATTTCTTTATAAGGGCTACAAGAAGTATCCCTCAAGCATTGGATATTTCAATCAAATTAC 

CCTTGGGGGGGTNGGGGGAAAAAAATNNGGGGGGGNTTT 

Sequence 244 

TCGCCCCGCGTCCGGGGAAGGCTAAGGCGCGAGGATCCCTTGAGCCCAGGAGTTCTAGGC 

TGCAGTGAGCTATGATCACGCCACTGCACACCAGTATGGGCAACAGAGCGAGATCCCATC 

TCTAMCAAATTTAAAACAAACGAACAAATGAAAAATATTGCTGTCTTAAGGTTGGGMG 

GGGCAGAGACCCCTTTGCTTGCTCATCACCAAGACACTTCTGTGAGGCCCCAGGGCTCTT 

TGGAGAACGTTTTGAAAATCACGGTTCTAAGTAATTATAGTTACTGTGACTGAACTAATT 

TAGCCCTAAGCTTCCTACAATCAAGATAGAGATACACTATGGACTGCATTTCTCCGCTTC 

AGATTAAAAAAAAAAAGTTTAAGTCAGAATGTAGTTATATTTTCAGGTAAAATGCTCAAT 

ACATTTTCAAGATGAAGCTGCTCAAAATTAAAGCAGTGAGTCCAAGGGTTAATCTGNAAA 

AAAAAGTACAATTTACTATCTCCTGGTTNCANACTTATAGACCCTCATAGGTGCATTGGC 

TAATACAAGGGGCCACTAAACACATTGTGGCATTTACNGGATTTTATTGGTGAAGNGCTC 

T ATAAGTTTT ATTGGTGCCAGGTAAAAGAAANGCCTCNTATAAAAAAATGGTGGNGGGGG 

Gi 1 1 1 1 1 t 1 1 Nccccccrn i r 

Sequence 245 

TTCCGTACCTAAGAATTGTACCCTTTCATAACAGCACCACTTGGATATGTAGAAAGAGTT 

TGTTGTGAGATCAGATGTAAAACAAATMAAGTTATTCGTGAAATGATATGGAAGACTGG 

GATTAGAAACTGTGGCATTCAAGAAGCCAGTTAAGCTGTTCTCAGAATTGACAGAGATTC 

TAGAGATGGTGTAGTGCAGATGGTGTAGTGCAGGGATTCTCAGCCTCACTGCACATGGTA 

ATCTCATAGGAAAATTTTAACAAGGACAGATGCAGAAATTCTTATTAAACTGTCAGGACT 

GAAGTCCAGAGGTCACTATATTTTAAAAAGGTCTGCAAGTGATTGTAACATGCATCTATG 

GAGGAAAACATCANCGTAGGAGAAAAAGGGAAAGAAACCAACTGGAGTAAAGGCTCTGTC 

TTGAGCACTGTGCTGGCTCCTGTCATTCTCCATCTCTTTTTATCTTCATAGTAAGTGAGA 

TGGGTTTGACAAATAGGGA 

Sequence 246 

GCGTCCGGNGTAACTTGAACNAAAGTATTCTCCTTCTTCTGTATATTTGTTCTCAACCCC 



• 
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CAATTCCTCATAACTTTCTAAGTAAGTAGGGTGAAAAGAAAGTCAGGGCTGCCTAGGGAA 

GCTACATTTGCTTCTCAAGGCTTGCACCTGCATATTTCATGCCCGACATGTTGCTGAACA 

CAGCCGAGTAAAGTGCCAGAATAAAATCAGCTCTGTCCACGTGGTACCAGGAATGCTGCT 

GAAATGAAGGTAAGTATAGGATTCCACAGGATCTAAGACAGTCCAGGGAACTGGGACCAA 

TTGTGGGAAACCTCAAAGCTGAGCATATTTGAAGAAAATGCAGAGTAAAATGGCTATGGG 

CTGCCGGGAGGCCTGGATTTGAGATACCAAGCACCACTGATATAAACACTGATGCTCAGA 

AAMGMATGATGAGTGTGTATTTGTGTCCCTTGAGACCTMTTTAATTTATAATTATTC 

ACCAGAGATAAGAGGCAAGGGATGTCTACTTGCTGGATATCTGAATTTACACAGTTCTCT 

AGTC ATTAGTA I I I I I I ACAAATATACCACTCAGACTAGGGGGGCAGAGGATCTGGGGTT 

TNMTTTCAAGACCTGTCACTTATCTGCATGAATTTTGNAGCTTATAAGAACTTACAGGT 

TAACCI I II 1 1 

Sequence 247 

TGTCGACCCCGCGTCCNTNGTATTTATAAANATAATNCTGNTAGATAAATAAGTGATTCA 

TATTTTGTCAAANCTATTTTMAATTTCAATATTTAAAATATTNTTGAATCACTGGGTGT 

CGNTAAGTGGCATCATNNATGAGATTTGATTCCATGTACCATATAATNTTAGATTGGTCC 

TNTCTCACCCCTTTTAAACTCCTTCMGCATTGCTATTACTGGGGTTGCCTTTGGGAAAA 

CTTACTTCTAGATACTACCATATATCTGAAATAGTAGAGGTGGATGTTAATAAAATTCAT 

AAAATNATCATGTATTACTTTTTTTGATTTACCACTGGAAGGAAATACAGNCATGTGCAA 

TATAATGACCGTTTTGGTCATNGAGACCCACATGTGTGACAGTGGTCCCATAAGGATGNG 

GCTGAAAANNTCCTGTTGCNGCCTAGTGACACTGTAGCCATNGNAACNCCATAGCACGAC 

ACGTNACTCACCTNTTCATGGTGATGCTGGTGT 

Sequence 248 

CCCCGCGTCCGATTTTGAATGTATTGAGACTAAAGTTCCTTTGAAACATTAAAGAAGATC 
. CATCAAACTGGCAACTGTATATAAGGTCTGATATCTTCTGGCACTCAGAGGGAGAGTTTC 
TGGTGGAGGTGGAAGTGACTTAGGAGTCATCCATGAACATAAATGAGATCACTCGTGTAG 
GGAAGAGTTAATGAGATTAGAGAGCCTTGGACAGAGCCTTGCAGCCCAAACCCATGAAAG 
AGCAGTAATAGTGAGTGAATTACCTTCTGAGGCAGTGCAAAGTAGATAAGAAAACAAAAN 
CTNNAAAAAAAAAAAANAANAAAAAAAAA 
Sequence 249 

CCGCGTCCGCAGTGGTNGTGATCTCGTCTCCTGCAACCTCTGCCTCCCAGGTTCAAGCAA 

TTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGATTATGTCTTGTGTTANNGAAA 

TCATTCAATTTTTCACCAGAGAATAAGGAAAAGCAAATTAGACTGTGAATAAAATCTCCA 

TGTACCCACTAGGATGACTACAATTCAAAAGGCTGACCTATCCAGGTGAAGATGAGAGTG , 

TGCAGCATTGGAAATTCTTACACATTGCTGGAGATTTAGAAGCTAAAGAAAGAGGACAAA 

TGATTTGGAAGCAAAGATAAGAAGGAAGAAGGAGATGGGGTCTTGCCATGTTTCCCAGGC 

TGGTCTCAAACTCCTAGATTCAAGTGATCCACCTGCGTTGGCCTCCCAAAGTGCTGGCAT 

TATAGGCATGAGGCACCATGCCAGGCCTGTTTTGAAATTTCATACATATTTATAAAACAT 

ATTTTCAGGATGAGAGAATATACCAAGAAGTTACATAGCATTGTGTATCTGTATAGAATA 

AAA 

Sequence 250 

TCGACCACGCGTCCGGGTGAACGTGGTCACCAAGGCCATGGGTACCCTGGGGGTGAGCTT 
ATCCTCCTGCAGCGTCCCTGGTTCCAAACCCACCTTCGAGCTCTCAGCCGACGAGGTGGA 
GCTGGGCCTGGGGATCCACGGGGAAGCTGGTGTGCGCCGGATAAAGATGGCAACCGCCGA 
TGAGATTGTGAAACTCATGCTCGACCACATGACAAACACCACCAACGCGTCCCATGTGCC 
TGTGCAGCCCGGCTCCTCAGTTGTGATGATGGTCAACAACCTGGGTGGCCTGTCATTCCT 
GGAACTGGGCATCATAGCCGACGCTACCGTCCGNTCCCTGGAGGGCCGCGGGGTGAAGAT 
. TGCCCGTGCCCTGGTGGGCACCTTCATGTCAGCACTGGAGATGCCTGGCATTTCTCTCAC 
CCTCCTGCTGGTGGATGAGCCTCTCCTGAAACTGATAGATGCTTGAAACCACTGCAGCAG 
CCTGGCCTAACGTGGCTTGCAGTCTTCATTACTGGGCGGAAAGCGGAGCCGGGTAAGCCC 
CTTGCCGAGCCCCAAGAAG 
Sequence 251 
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GTCGCCCCGCGTCCGCAAGAAATGATGTTAGTAAAGATACTTTAATTGCGGGGAGTTCTC 

TGTTGGCTGGTGAATAAGGACGTTCTCTTTGAGTTCCACCTTTCTTTGAGAAATTTTCAT 

GTTTCCTGTACTTTTCCTCTATTATATCATGGAAACTTACTAAATCAGCCTGTGCATTCC 

GCCCACATATTCCCTCATGGTGCCTTCCGGTCCTAGCAGCTGAGCCAGGGCCATCAATAA 

ATCTTCTTCTTTTCCCCCAAGATATTCCTAGTTGCTTCTTTACTATTCATTCTTGTAATT 

CCATTCATCTGTGTGTATCATTTAGGATGCTGGGCAGTTGCATCGCATTTCACCTAAGGT 

CACATCCTGATGGGATTTGGTGGGATCCTAGAGCTGGCTAGGACCACTTGTGAGAGCTGA 

TTGTTAAATTCTCCGAAATGACGTGACTCAAGTTGTTAGACTGCTTAAAGG 

Sequence 252 

CCGGTGAAACCTGGAGCTGAAGTGAATTCTCTTAGAGTATATTTTGAAACTGTACTAGGA 

CTTTAMCACTTTTGGAATTTAAAACAGCCATAAAAATTCTTGTTATACTGAAGGAGTTC 

CTGAGGCAGTGTGCCTCTCATTTTACCACCTAAAGTTGCCATAGAGGTCCAAGGAGACAC 

TGCTGATAGCAGAAAGTCTTCCAGAAAGAAATTAGGCGACCCACACCAAGCATGTATGGC 

TTTGAGTCTTACAGATGGCTTTTTAATAGTTTAGTCTCTTAACCTAAGGAAGTTTCTGAA 

GTTCCGGTCAGAGAGTCTAAAAATTCACATTTTACCTAATAAATGATAATGAGGCTATTT 

ATCTTGTCTGTCTGGATTTTTTCACTTGACATTTAATGAAATATCCCATATTACCTATAA 

TTTTTATTTGAAG 

Sequence 253 

CCCCCCGCGTCCGAGATAATGCTGTTTGCTTCCGGCCGCTGTAAATCATAGGTGAAAACC 

AGTAGCANGTGCTCACTCAGTGCCTCCCAGAAGCGGTCTGCGGGTCTCAGCTGGGCTGGG 

GGCAGTTTTCATTGGGCAAGGCTTGGGCTTAGCTTCGAAGCANGGGCTGGGAGAGGATGG 

ATGGGGGTGTGAGAGCAAAAGAAAGACCTGGCTTTGCAGTGATGGCANCCCACGTTCAAA 

TNNNAGCTCACCACTGACCNGTCGNNTGACGNGCGCCAGGNGTTAGGAGACTGNAACTGN 

TTNTGNGTNNNGNNTCCGGNCGTNCATNNNNNCTGCTCAGCATACANANCCTNTTNCTNA 

TCNTAATCCTCATACNCATGNCTGNNNACTNTACACTGTTCTACTTATCAATGACAGGTC 

AAAAGTGTTATCATNTGTGACNTAGAATGAGTGAAGTACACNCCCTCTTGGAAACTATGA 

ATGACTTAAAGAATCACCNTTTGCAAAAAATC 

Sequence 254 

GCCTTCGCCCCTGCCTCCTCTTGGCTGCGGCTGGTCATCTTCCCACCTCCACAGTGGGCC 

GAGCTGCCAAACAAGGGGTACACATGTGCCCCCTGACATAGGCCAGGTGGTCTCTGCCCT 

CTGTAGCTCCCATGAGAGAGGGCTCCTCGGACCGAAACAGGAGGCCACTGCCCTTGCGCA 

CACACCGTGGCCGGGTCCTCCTGGGCGGAGCGCTTTCCGTGTGTGGGAAAGTCAAGGGCA 

GCCCCGAGCCTCGAAGCCCAGGCTCCCAGCCCCGCGCCATGTTGCATTCCCGCCTCTACT 

CCTTGGTAGGCTGGNTGCTTTGAGTGGTTCTTTTTMTTCTTTCTGTTGGTTTCTCCTTT 

TCCTTTGCCTGGGTTTTGCTTTAACCTCTCTGTTGCAGAGATGCAGAGCACTCAGAGAGC 

CTATTTCTATCATCGCTTTCCTATTCTCCACCTAGAACCAGNTGACTGGCCGCCCGAGTG 

GNGTCTCTTGTGTGTGTGGTGCCGTCAAAGCTGTGGAAAGAAATGCTTCTGCCTAGGTTT 

CTTCGCGCCCCCCCCTTGCTTGGCTTTTCCTGCCTGCTTACACCCCCGGTTCCTGATCTG 

CCCTGGGC 

Sequence 255 

GCCCACGCGTCCGCAAGANAGCTCCTCAGATTTGTCATAGACTATATTTAAAGAAAGGCC 

ACATTTTTCTTATTTAAAATGCATTAAACAATGCANCCAATTAAAAGAACTGAGNTGGAT 

TTGTACAAAAGCAGGGACTAGGTCTGNTTTGTTCACTGCTATATTCCCAATGCCTAGAAC 

CATGTCTGGCAAACATACTGGCATGGGAAGAACATTTCCATAACCCCTGAATGTTCTGTG 

C CCCT TTCCAATTAATCCCTACCCTCAGAAGCAACCACTATTCTCATGCTTATTACATTA 

GTTTTGCCTCTTCTTGACTTTCATATAAATGAAATCATACATCTAANAAAAAANANAAAA 

AAAA 

Sequence 256 

TCGACCCCGCGTCCGATTTGATATAAATAGTTATGTTACTCATATAGAAATCTCTTCCCC 
ATTACACACATACAAACATTTATCTATGAGTGGCTTATAATTGCAAATAAGATGTAAATC 
ATGCTCATGATCATTGTCAAAATTGTGAAAGA I I I I I I I CTATACCTCTTTTAGGTTTGT 
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TTTGTTTTTGTTTTTGATTTCCAGGTGGCATTAAAGACAAGAGGGAATAATATTCATTCT 

TACTTCTACTCCCMGTCACTAGTTTGCTGAATTTAATTGAGTTAAAGAATTGTATCAGT 

CTTCTTGGAAGTCTAATACAAAACCAGTTCAACACTAGTTATTCATTCTTTGCTAATTCA 

CCAGAATTGAAGGATGGATAAAATGAGAAAGAGAAGTAGTTCTTCATATTATTAAAATAA 

AGAGTTAMTTAGACACTTTGTTGGACTCTTTGGTCTTMTMTTCCTACTCTTTTTGAG 

GTCCAAAAGTTTTGTCTTTGATAAATATAATTTTAATGGG 

Sequence 257 

AAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAATGC 

ACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCTGGGGAG 

TGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAAI I I I I AGTTATATGTGCTGTAAT 

GTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCACA 

AATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 

GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGC 

CTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGC 

ATAMCAGAGCAGTCGGCGACACCGATTTTATAAATAAACTGAGCACCTTCTrTTTAAAC 

AAACAAATGCGGGTTTATTTCTCAGATGATGTTCATCCCGTGAATGGTCCAGGGAAGGAC 

CTTTCACCTTGACTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTTCTTT 

Sequence 258 

GAGTCGACCCCGCGTCCGCTCTGGAGGMGCATAGATTAGAATCATGATTTTTATCTATT 

TTAAGAGAATAGAAGAACAGAAGGGGTTACAATCTTGCAATATTATGCAACTCTTCTGCT 

CTAATATATCAAAAACTTGATGATCCAAGATCATGCAGAACAGCTGAGAAGAAATCAAAG 

TAAACAGTGTACCTTGCAGCCAACAGATCCTGCCAATATGAGATTAGAACTCTCCATCCT 

AGCAAAAAAAAAAAAAAAAAA 

Sequence 259 

CTGGTACCTGCGAGTCGCTGCAGCAGCTGTGGCAATTGTCACCTTCATCCAGGCCCATCC 

CGCTTTGAGGGCCTAGAGAGAGTGGGCCAGAGGTTAACCCCCGATTCATCTGCCTCCCCA 

CGCTGGGCATCTGGGTGTGCCAGGGCATTCCCCCGCTGGTCAGACAGGTTTTTGGGCCAG 

GGCGGGGCTGACCAGGGTTAATTAGAGGGAACTGGCTAGGAGGAGCTGGGGAGGGGGCTG 

GGGAGAGTCCAGGCCTNCAGAGCCCCTGGGACACAGCAGGTGTGTGCTGCCATGGGCCGG 

GGCTTGAACTCTGCCAGACTCAGGCGCCAAAAACGGCGCTTGCGACCTCAGGTCCAGAAG 

CCCCGGCAGCAAGCTG 

Sequence 260 

TCGACCCCGCGTCCGAAACTCTGTCTAGTCTAAACTATTATTCTATACTTCTCATCTCTA 

TATGTTAAGGATTGATCTCCAAGATAAATTG I I 1 1 I I GTTTGNTTTTTAGGGACAGGATC 

ATGCTCTGTTGCCCAGGCTAGAGTGTAGTGGAACAATCATAACTCACTGCAGCCTCGAAC 

TCCTGGGCTTAGGTGATCCACCTGCCTTGGCCTCCTGAGTAGCTGGAATTCCAGATGCAA 

GGCACCATGCCTGGCTMCTTTTTAAAATTTTTCATAGAGATGGGGTCTTACCATCCTGC 

CCAGGCTGGCCTCGAGCTCTTCACCTCAAGCAGTCCTCCTGCCTCAGCCTCCCAAGGCAC 

TGGTATTGCAGGAGTGAGCCACCACGCCCAGCCCAAAATAATTNTTTTTTAAAGCAAGAT 

GTAGAAAAGTGATTATAATATGTTTCCATTTAGGCAAGAAAAAAAATGGAGAGGACTATA 

CCTGTACTCTCTGNACATAGGATCCACAGAAAACTTCTAATGGATGGTTATCCCTGNGGN 

GGGAAACTGGGGGACAGGGAATGATGAAGCAGGAAAATTTTTACTGGATAAACTTTAGTT 

CTGGTGGCTCTTTTTCTTCTATCATGNGNATGGTAACTT 

Sequence 261 

GTCCCGCAAAGCCTTTAAAMGAGTCCGMTTTCACTTTTCACCTTTTGTAGATGTGCAC 

GTGTAGCTGTAGAGCTCATACTTACGTTTCACATGGCATAGTTGATGGATATGTAGGTGT 

AAAGTTTATGGTAGTGGACAGGCTGAGMTGGTGTATCTGTGACAAAAAATCTGATGGAA 

GTGATATATTTGATATGAMGTGMCATTTTCTTAGTTGGGTGTTTATAA CI I I I I I I I G 

GTAMTTGTTTTTAGTTTTTATCCTTATTTTACTTATGCTTGGCMTAGATGGTCTTTTT 

TCCCCAAATCTTCTTCTGMTTCCAAGGAAACACTGTTTTAGCATTTATTTGATTACTTT 

GGTTCATTCTTTTCTCCACTCCCATTTATTTGTTTTCCATTTTGTAACTTCTATAAAGCA 
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GATAAAAATCTGGAACTTCTAGATCTGACCTTCATGCCTTGTCTTTTCTATGGTACTTAT 
TCTTTCTGTCTCCTTCTCATTTTGGATTGGGCTTATGAGAGAAATCTTGGGGTTGATCTT 
CCAGCTCACTMTTTGATATTCATTTGTGTCTCTTCAGTTACTTAGCTTGCGTGAAAACT 
1 1 1 1 1 I I I C AG CAATTGTGTACTTAATTTTCATA 
Sequence 262 

CCNCGCGTCCGTGCCCTGGGGCCCCCCTGGGCGAGCATACCCTAGGCGTCAGGCCTGGAG 

GTCTCCTCGGTGCCTCCCAGCTCTTCTGTGCTCCTCACACTCTGCTCTGNGCANATTGGC 

TGTATTTGTAGGTTACTGCC7TTTATTTCTCACATTTCTTTTTGGTGAGAGTATATCAAT 

CAATCAATCATCCTCAGACCTCTATGATAACACTGTGCCCCACACACAAGAAGCTACTCA 

ATTMTGTTTGTTTGTTGAAATGAGAGAAAATATTTGTCCTTAGTACAGAAAGAAGATGA 

AGCCAACCTCTGATAGAAGCCCACCCATAGACTAGGGTGTTGAGCTGCCTTCCAA 

Sequence 263 

NCCCCGCGTCCGGTAGTTGCAGAAGCATGTTCTTGAACTTATCAGTCCTGACCTCAGATT 

TCATCTTCTTCTGGTAGATGTAGCATACATCTCTGAGTGTTATTAGAGGACCAGNCTAGA 

GCCTCATCGTACTCCTTCAGTTACTTCATAATCATCCAGCTCTTTATATTAAATGTTTTT 

CTGTTTAAAATGGCAGCAATGTTTTTTATTTTTTGAAATGGTCCCTGACAGCAACAGATC 

TTCTCGTGTTTAATTATTGAGTCTGTGCTGATTACACAGAATTAAGGATATAGTTTCTAA 

AGTACTTCCATTTTTATATATTTTAGCATTATTCTGAAAGGCCTGGAAAAAAACTATTTT 

TTATT CGATTTGAAAGTGAAGTGACATAGGTGGGTCGCTATAGCAAGAAATTACCCTGTA 

TTTTTCCATCTCTATCATCACAGGCATCTCACAGAATTAGAAGTCGGACATTATTGATGG 

ATATATTAGTCATGAATAATTAAAATACATTAAATATAAAATGGGTCAGATACGGGCAGA 

TTT 

Sequence 264 

CNCGCGTCCGGGAGAAAAGAGTTTTATACCTAATTCTTNTGCAAGTGATTACAT AI I I I I 

ATA CCCA GGATGTTCAGCAAGATGMCTTTATTTTTTAGATGTATCTATGGNTTTTCCAC 

CCATTTTATTAATTTTTAGAMTATAAAAGTGCCTTTAAAATTTAGCTGGGTTAAGATAG 

TAAGTTTTAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGAGC 

CTAGATCGTGCCATTGTACTCCAGCCTGGGCAACAAGAGCGAAACTCAGTTTCAAAAAAA 

AAAAAAAAAAAAAAAAAAAA 

Sequence 265 

GATCCTCTAAGTNCCCAATGATCNGAGAAGAAATATGAAAGGGAATTTTAAAATATTTTG 

AACTGAATGAAGATGAAAATGCCACAGATAAAACTTTGCATGGGGNAGCTACATTAGTTA 

GCTTAGAGGGAAATTTATATTTTTAAATTCTTATATTAGGAAATAAGTCTTAGATAAATA 

ATGTCAGCTTCCACCTAAGAAGTTCAAAATAAACCCACAGTAAGCNGAAGAAAGGAAATA 

ATAAAGCTGAGAAAAAATCAATAAAGTTTAAAAGAGAAAATTAATAGATATCAATGGAAC 

AMAG7TGCCTTTTTAAAAATATCAACAAAATTGATAGACCTTTAGCCAGAATGATCAAA 

AAAAAAAAAAAAAAAA 

Sequence 266 

CCCGCGTCCGCCTGAATTTCAGAGTCNGTTGAGAAGGTAGGGAGCAGGGATCTCTCAGAC 

ACCMCATGTTCAGCTTTATTATAGATAAGGCTGAATTGATTTCCTGGAATNGACCTTAC 

TTCCCCACCTCCAAACTGCAGGCCTTCCCCTTGCAGGTTCAGAACATGTTAAGGTAAGTT 

CACTGCAGAGATAAACAGGGTGAGGACAGGCCAGTCCTAAAAAAAAAAAAAGAAAAAAAA 

A 

Sequence 267 

CCGTTCTTTTTCCNAAACTAAAGAATGCATAGGACATAAGTTAAAAGTTCATACATAACC 
TGGCTTCAAATCCAGTTCTACCACTACCTGAAAACATCAGTTTATTTCTCATCAATGGGT 
TGTTATAAAGTACCTAGCATAGGGTATTGCTTAAATGTTAATACTCCCAATCCTGACACT 
AATGTTTCAGGGAAGAGTGAAAGAAATCACATTAACTCCACATTATTGAACATCTTCTGT 
GTCAGGCTGMTACATCTTTGTATCCATTGTCTTCTGGTTGTTTAAAGACTAGTGTAGAA 
GCCTGACATGTAAATCGGTGATTATATAAGATAGTACTGTCTTGTAATGTGTTGTGCTAG 
AGAGGTTAAGTATTAATTATTATGGGAGCTCAGGAGAGGCACCCACTGGCTGCTGGGAGA 
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AGAGCMAAACCTATATATAAAGCAATTTTTAAGATTTTGGTTCTATAAGGGTAGAATGT 
TCGATCCATTAGTGTATTCCCAAAGTCCTATACACATGTCAAGATCGGGAAAGCTCACAC 
ACACAATAATGCCCAAG 
Sequence 268 

CGTCCGGCCCCATGCCCAGCCAAGTTATGTTGTTTGGTCAATTTTAGATAATTAAAAAAA 

AAAATCCTATGTAGTAACATCCTACTAGGAAAGAAGATAAACTCAANTATCCACTGAGTG 

GTGACCfGAAACCAAAGAGACATGGAGGGGCAGCCAGTCAAAAGCCACTGCACTCCAGCC 

TGGGCAACAGAGCAGGACTGTGTCTGGAAAAAAAAAAAATCTAGAAACATACTCAAATGT 

AAACATAAGAACTTAGCATATGGTAAAGTTGTGACTGTGAGTCAGTGCAGAAAAGAATGA 

ATCAATCAGTAAATGGTCCTGGGAAAACCGGCTATTTATTTTAAAATAATAACAAAATAT 

GCTGTATATCATCAATGGATTAAAGATTTATATACCTTCAGCATATGTCCTTAAATGTTC 

TAGMTAAAATATATATGAATTCMTGTAATTTTTGATGTGAGGGAAGGCCTTTTAGACA 

TGACACAAATGTCAAAAAGCATAAAAGGAAAGATTTTGAATAAATAATAAAC 

Sequence 269 

GCGTCCGACAGATACACCATCATTATAGGTGGCCAAATCATTCAAATCGCTGTTTGCTTC 

TTCTCGGGC TCTTA TTCAAGCTCACGATGCTATTCTCATAGTGATTGCGAATCTCACTGC 

ATAAAGGTGATTTTGAAAACATCACTTTTCCTCCACACTAATGTGTGAGCTTAGCCAGAG 

TAGGAGATATATCTTACTTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 270 

CCCTTTTTTNCGGCCNTNCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAA 

AGGGCTGTGTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCAT 

CTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGGGGAGCCGGATGTCAGAG 

GTCCTGAAATAAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCA 

TTCGGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAAGA 

TGCTGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGGCAT 

TGGGATCATTGCATTGATATTAGCACTGGCCATTGG 

Sequence 271 

ACCGCGGNGGCTTCATGCAAGCTGTGGGCATGGNCAACGATCACGAAAATCATTNTTCCT 

TTAAATAAAATACAATCCTATNNAAAGGAGTNCTTCCATGAGCAACAATCAAATACGTGC 

TTATGCTGCGATGCAAGCAGGTGAACAACTGGTTCCTNATCAANTTGACGCAGGCGATTT 

ACAAGCCCATCAAGACGAAGTCAAAGTTGAATATTGTGGATTATGTCATTCAGATATTTC 

AAGTGATTAATAATGATTGGCCGATCATCTACTTATCCTGTAGNCGCAGGTCATGAAATC 

ATTGGAACCATCACTGCTCTTTGGTTCATGAAGCGAAAGGACTCAAAG 

Sequence 272 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAAATTTAAGTATGAAAGAAAGAAAAAACAAAANGTTGAAAAAGAAAAACAATCTTTCAC 

AAACAATAGAGAAATTCGTTTATCTTTTGGAATCAATTTAAATGATATAAAAATCAAAGC 

AAAAAAAGCCAAAGAATTTTTATTAGATAACCGACAGAGTAAAAGTGGCTCTTCGTCTTA 

GAGGGCGTGAAAATACAAGACCTGAACAAGGTAAATTAATTTTAAATTCTTTTTTTGATG 

MGNAAAATCGATTGCAAAATTAMGTAAAGMNNGCAATCAGTTTGGTAATTTTTTAAC 

TCTTCATATTGAACGTGATAAGAAAAAATTACCCAAATTTACTTCTTCAAAACAAATAAA 

GGAATTAATTGATTTTGAAAAAACTATTNAAGGAAGGAAGAAACTAATGCCTAANGCCAA 

AAACAAAATCGCACTTTAAAAAAA 

Sequence 273 

CCCTTAGCGTGGTCNCGGCCTTTGTACGAAATTATGACTGTTTTAGCTNCAGGAACAGAT 

TTAAAAGTMTTGAAGACGTTTTAAMTCTGTTTTTGATGCTAAGAACATCGAAAAAATT 

G AAAA ACTTGAAAGAACAGAGTTAGCATACGAAATTAAAAAACACAAACAAGGAATTTTT 

GTTTTAGCTAACTTAAAATCTGAGGAMGTTTAATCGAAGAATTTGTCAGAAGAGTAAAT 

ATTCTCMAAMCAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCTGAAAGAGGAATG 

CACAAMCTTTCAGACCTAGAAAAAATGATAAACACAAATTTTTCTCTTCTAAAAAACCA 

ACANCTTCAACAGAAGAAGGTMAAGTTTTCAAAAACCATTTGTCAAAAAACCTTTTGTT 
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AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGANCAAGACAAAGTGCTAAGAAAA 
CCAAAAACTGTAAAAACCAGCAAAAGATCCTAAAGTAGCTCACACAGCAAAAAAAAAAAA 
NAAAAAGTCCTGC 
Sequence 274 

cccttagcgtggtcgcggccgaggtacnaaattatgactgttttagctccaggaacagat 

ttaaaagtaattgmgacgttttaamtctgtttttgatgctaagaacatcgaaaaaatt 

gaaaaacttgamgaacagagttagcatacnamttaaaaaacacaaacaaggaattttt 

gttttagctaacttaaaatctgaggaaagtttaatcgaagaatttgtcagaagagtaaat 

attctcamamcaagttttaagatttttagttattaatctagattctgaaagaggaatg 

cacaaaacmcagacctagaaamatgatmacacaaatttttctcttctaaaaaacca 

acmcttcaacagmgaaggtamagttttcaamaccatttgtcaamaaccttttgtt 

aaaaaatcagaagaaacagatt'cttcaaaacaaaatgaacaaggacaaagtgctaagaaa 

acccaaaaactgtaaaaccagcaaaagattcta 

Sequence 275 

CCCTTAGCGTGGTCGCGGCCGAGGTANTTNCCTGANCAGTCGAAGTGGATGCCCAGACCA 

ATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAAT 

GGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATT 

TTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGA 

TCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCT 

ACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCAC 

AAGTGTTACTGGCTTCCCATAAACACAAGCCCTTTCCT 

Sequence 276 

CCCTTTCGAGCGNCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

AATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTT 

AGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACC 

TACTGCAGCTTTTTTAMAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGG 

AGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAAC 

TMTATGTTGAAAAAAGGCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGG 

AACAATGAAAGCGATTATTAACTTAATCGAATCTCAAGGTGCTGGTTGNTCATAAAGTAA 

TCI I I I I ACTTGAATTANGA I I I 1 1 AAACCGG AATTG AAAAACTTAAAA 

Sequence 277 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTTCTAANGTTAAATCCTGAGGTAAGTCAACC 

MMCAGGATGTGMTATCCTMTMTMTTCTAAAGTCTTATCTTTTAATGTTGCTCTG 

TMCCAACACCTTTGATTTCTAATTCTTTAGAAAATCCTTTANAAACTCCGGTTAACATA 

CCTTGTAATGATGAATTTGTAGTTCCGNGTAATTGNTTAATAI I I I 1 1 1 CTTCTGATGTT 

CTTTTAGTAAGTAGAGTGTTTTCAAC I I I I I C AATTG AAATAAATGATG AAA ATTCTCTT 

GATAATGTTCCTAATTTACCTTrrATAGNTACTNAAGMTTGTTAATGTTTACTTCAACA 

CCTTCTGNTATTTGTAAAACACGATTTCCGACACGAGACAATATTAATTATCAAATGTAA 

GCAATGGATTTCACCACCTACATTTTTCCCTTTCTTGCNTTGTATAATTAGGTTNAATAA 

ATCCTTT 

Sequence 278 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGATAGTNNTCTCAGTAAAAGGTCTATTATCT 

AACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCA 

CAGCTTGAAMGCMTTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGC 

AGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGANCCAACAATAAAAATAGCACT 

AGCAATAANAAGAATGCCATCCCATGGAGCACACCATAAT 

Sequence 279 

CCGAGGTACTAAACTCTNTTTAATGNNCTAACGTCAT AI I I I I I AAG I I I I I CAATTCCG 
TTTAAAMTCCTMTTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCG 
ATTAAGTTAATAATCGCTTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCT 
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ACTTTTTGGCCN I I I I I CAACATATTAAGTTNGGATTTCTAAGAGTTGATTTACCATATT 
CTAMTCATACTCAAAACTMTMCGTCTCCTGGGTMTTTTTTAGGGTTTNCTTACCAT . 
AAATAAAAAGGGGT I IT I I I AAAAAAAAGCTGGCAGTAAGGGTGTCCCAAACAAAGAAAA 
CCCTCTTGCGGTCTTGGGACCTATAATAACAATCTGCATCTTTAAGCTAAACTCAAGCCA 
TTTGGGATTMTTGGTGTAMTTTMGCACTTTCTCCATTTGGCTAAAAAGTGGGGNGAA 
MTATCTTTAMMTACMTCCCTTCAATTGGGGAAMATCTTTTAACATCCTCTGGATG 
GNATTTCTATTAAAAATTGGATTTGGA 
Sequence 280 

TATTrGGTGMGATCAGCGTTATCAGCATTTTCTACGATTAAACGCTGGCCATGCTTTGA 

CTGATGAAATCCGCCAAGCTATTCAGCAGTTGGCGGATTGGGTCAGAGAAAGTCTCAAGT 

CGAAATTGAGTTAGTCGAAGTCTGCTTTCAATACGATGCGATAACGTGCTTGACCTGAAT 

GAAGTCGCTCAATGGCATCGTTGAGCTGTGACATAGGATAGAGTTCAATTTGCGGGGCGA 

TATrTTTACGTGCTGCAAATTTGAAGAAGTTGACGAAGTGCTAAAGGAGAACCCGTCGGT 

GAACCTGTTACTGATTTGGCACCATCAATCAAAGCACCCGACTGAAACTGGGAATAGGTT 

CTAAAAGTCAGAGCTAGAAAAATG 

Sequence 281 

NGGTACCTCCACCGCGGTGGCGGCCGAGGTACANCCTCTCGGCCCGGCTAAACATCATCG 

TCTTGGTAGGTCATTACCNTACCMCTMCTMTGNTCCGCACCCCCATTTTTAAGTGAA 

GCTGNGAAGCTCCTTTCTATTACTCATCATGCGATAMTAACTATATCCGGTATTAAGCT 

ATTGATTCCAATAGTTATCCCAGNCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 282 

CGANAAGCANGATTTTNAATTNTTGCAGCCCGGGGGATCCGNGGAGNGGGGAGAGCCCAC 

CGCGGGGGAGCGCCAACACGACNNAGAGCGAGNCGNAATACGCGCGCACACNGACCGCNG 

ANNAACAAACGNNGAGACGGGGAAAACCCNGGCGNCCNCAACAAAAACGCCCGNGCAGCA 

CAGNCCCCAAAAGCCAGCNAGGCGGAANANCGAACAGGCCCGCACCNAGCGNCCGNCCCA 

ACAGNAGCACAAGCCCCGAANGGCGAAAAGGG 

Sequence 283 

AAACCCCCTNAAAAAGTNCCAAGGGGCCGCCAANNCGACTANAGCGAGNCGCATTACGCG 

CGCNCACTGTTTGTCGTNNTTAACGCCGCGACCGGGNNGAACCNGGGCGNNACCCAACAG 

GAAGCGCCGCGCANCACAACCCCCNACGCCAGCAGGCCGAnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnAGNGGAGGAAACAGCGCAGCAGAGACCGNNACACANGCCAAC 

GCCCNAGCNCCCGCNCNNNACGCNAGCAACCCCACCNAACANCGNCCACGNACGACNGCN 

NGCCCCCGCNCAANCNCNAAAACGGGGGCCCCCNANNANGGGGACCCGANAAANCGCNAN 

NACGGCACCCNCCGACCCCAAAAAACNNGAAAAAGNGAGAAGGGAGCACGNAAGNGGGGC 

CAANCGCCCCNGAAAAGAACNGGCGNCCCGCCCNNANGAACGAANGGGANGACCCACGNC 

CCNAAAAAANAGGGGGACACCCGAGNNACCCAAACGGGGNAACAACACGCAAACCCNAAG 

GCCGGCCGAAACGNGAANGGAACCAACAAGGGGACACANGCCCGAAAACCGGNCCAAAGG 

GGGAAAAAAAAAAGAAGCCGNAANNCAAACAAAAAANCNNAACGGCGAAAGNNNAAACAA 

AANAAGAAACCNCNAACAACNCAAAGGNGNCACNNNNNCNGGGAAAAAGGGGCCCC 

Sequence 284 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATNGGAAACAATAGCTAATAC 

CGGATATAGNTATTTATCGCATGATGAGNAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGT 

TNAGCCCGGGCCCGAGAGGCTGTACCTCGGCCCGCCACCGCGGTGnnnnnnnnnnnnnnn 

nnnnnnnnnnAGNCGTATTACGCCGCGCTCACTGGCCGGCGnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnACTTMTCCGCCTTGCAGCACAATCCCCCTTTTCGCCAG 

CTGGGCGTAATAGCGAAAGAGGCCCCGCACCCGAAC 

Sequence 285 

CGTCCGTNAGGCCTGTNCTTACGGCTGGGTTTAGAAACCAGCCCATTCAGAAAGACTGAA 
TCAGAACATGGATNAAGTGAACTNATTCTAAGATGACTCGNNTATCCATGTNGATTAATC 
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TNCTGGNTCATAATAGGCCTCTTCCCTTTGATTGAAGGGTCACGTNTAAGTATANAAAAC 

ATAAAACTGTAAGGTAGAGGMGCGAAGGATAGCTTNGTATTAATGTTGCGTTAAAGCTT 

CAGAGACAAGAACAAGAACACTCCTCCCACGTGACAGCATTTGAATAGGAGGCGGNGGGT 

GCNGCAGCCTGGGCAGCTTCAGTCCCGATTTACAATAAAGTACCTTGNGNGTNATTAGTT 

CTTAAATGTTTATTTAGAAATGGCATTGATGTT 

Sequence 286 

GCGTCCGGTCACCGCACTGAACTTCGGGACCAAAAGCTTCCATGCCGCGGNCCCCAGACA 

AAGGCAGCTTCCCGCTGGACCACTTCGGTGAATGTAAAAGCTTTAAAGAAAAATTCATGA 

GGTGTCTCCGTGACAAGAACTATGAAAATGCTTTGTGCAGAAATGAATCTAAAGAAGTAT 

TTAATGTGCAGGATGCAAAGGCAGCTGATGGCACCGGAGCCGCTAGAGAAGCTCGGCTTT 

AGAGACCTAATGGAAGGGAAGCCAGAGGCAAAGGATGAATGTTGAGAAGGGAGCCACAGG 

ACCTTGTCCCCAGCCTGGAGCAGAGCTGAGCCCTTCTGCCCACAGNGCAGGGGGACCTGA 

CACTCAGCCCGTGCTGGCCCGTGCAGGGGCTCTCCCTGGG 

Sequence 287 

CGTCCGGTGCACTGCAACTTNTATATNTAACCCTAAACTCCAATAAAACAAATTCCAGGG 

AAAACCAAGGGTGTAATGGGATGTGCGTGTTTATCAGGAGTGTGCTCTCACGTGGATGCT 

GAATGATGGAGGACAGCGGACTGCATAAGCCAGAAACCTGTACGGGTGCTGGCTGTGGAA 

AGACGTGTCTGTCTCTATCTATGTACAATAGTTNATTCTGTCAGGCTGAAAAAGTATGGT 

CTTTAGGACCTTGCCCCTCTAACTATAGAAACTTAAACAGTGTACTGCTATTAGATATAT 

CTGATATTAATAGAACATGCCAAGTGCAGGTCCCAAATGCGTATTTGTGAAGCACACATC 

TGAGTAAATGGCTTAGATGGAAAGCAAGTCATCATGAGTAAAAATTAAGCCTCAAACCTG 

CCGGTGCTCCTCACCTCTTTGTCACCAGGTAAAGGTCACACTGTGTGTTGCTTTTGNTGT 

CTTCTCTTCCTAACCTAG 

Sequence 288 

CNCGCGTCCGGGGCTTGTTTACTATGGCCGATGATCTGGAGCAGCAGCCTCAAGGCTGGC 

TGAGTAGCTGGCTGCCCACGTGGCGCCCCACTTCCATGTCTCAGCTGAAGAATGNGGAAG 

CTAGGATCCTCCAGTGTCTCCAGAATAAGTTCCTGGCCAGATATGTGTCCCTCCCAAACC 

AGAATAAAGAT CTGG ACGGTGACTGTGAGCCCCGAGCAAAAGGACCGCACCCCCCTGGTG 

ATGGTGCACGGTTTTGGGGGCGGCGTGGGCCTCTGGATCCTCAACATGGACTCACTGAGT 

GCCCGCCGCACACTGCACACCTTCGATCTGCTTGGNTTTGGGCGAAGCTCAAGGCCAGCA 

TTCCCGAGGGACCCGGAGGGGGCTGAGGATGAGTTTG 

Sequence 289 

NGGAAAGCCGGCATAAGTGACATNGTTTGGGCAGTTGCCNGCTGGACTGAAGGGCNAACC 

CANACCACCTTAAGCCATAAAAGCCCGTGACACTGCTANCNAAGGTGCCTTGCCCACCGC 

TTTGCCACCNGTCCCGGAAATGNAAAAAAAGTCGCGTGCCNTAAAAAAGCTGCCGGAAGG 

NCCTGGGTGNACNTTTGGGCCACCCCCACCCCGCTGGCAAGGNCTTGAATTGNGTNACNC 

CAAAAGACGCCCANGCCGGACCTTGGNAAANNATTGTTNTTTNGGGANAAAAAAAAAATG 

GANCCCGNTGGGGGAGGCCCTTGGGGGCATTGNGNAAGCCCCCGGAGGGTTCCCCGNTGT 

TGGCNGGGGTCAAATTCAAAGGCCAGTGGTNGGCCACCCCGGGGAACCTGGNNGCTGTTG 

CAAGNACCNNGGTGGGGAACCGTTTCAAGGAATACCCCAACCAACCCANGTAGCCACCTT 

AAGGTAATTTGGCCCACCTTGCCCACCANAANGNGCCATTGGGGAAGAAACCACCAAAAA 

CGTTCCCCCCGGGT 

Sequence 290 

CCCCGCGCTCAGTATGACTCTTTAGTCCCAGTTTTCATGGGTAGTCTCTAAATTCTTTAC 

CTTTATGTGATTGTGAGTTGGGAGGTGGTGGGCATCATCTTAGTCCATTTACCTTTTTCA 

GTTTTGTAATT ATCG TCTTCATCTACTACCTTTATATAATATAAAGGGAAGGGGTCTTCC 

TTTACAAATAGTTTTATCATCCTTCTCTTTTTGATGTCTATATCTTCTA I II 1 1 I GAGGA 

GAATATATGTTGTATAGACCTACACGTGGGTGGAAGAAGAGTCATGTATGTGTGATTGTG 

TGAGATCCAGMTGTTGGAACTTTTAATTTCTTATTTTGTACTTATAATTACCTGCTGGA 

ATACCTGGTTGCATTCTGTATATTGTACCCTCATTTAAAAGTTCATGGAGGCAAAATAAC 

TCTGTTGCACATAAGGCCGGGGCTTATGCATGTCCTATCGGATGTGGGCTCAGATCACGG 
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AAGACCCGAA 
Sequence 291 

TCTTGCAGACTCAAGCTCCGCGCGCAGCCGCTCCTGGTGCGGGCCCACAGCAGCCTGGGC 

CCCGGNTCGGCCCCGGAGCCCCCTGGCCTGCGACGACTGTTCCCTTCGATCGGCCAAATC 

CTCCTTCAGCCTCCTGGCGCCCATCCGCAGCAAGGACGTTCGCAGCAGGAGTTACCTGGA 

GGGCAGCCTNCTGGCCAGNGGGGCCCTGCTGGGGGCAGACGAGCTGGCCCGCTACTTCCC 

AGACCGGTACGTGGCGCTCTTCGNGGCCACCTGGAACATGCAGGGCCAGAA 

Sequence 292 

NGGCCCGCCCGGGCAGGTACTAAACTTTMTTAATGAGCTAACGTCATATTTTTTAAGTT 
: TTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCAC 
CTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGTTCCGCCCAGNTAGCTAAAACATC 
ATCAATAATTGCTAC I I I I IGGGCI I I I I I CAACATATTAAGTTTGGATTTCTAGAGTTG 
ATTTACCATATTCTAAATCATACTCAAAACTAATAACGTCTCCTGGTAAI I I I I IAGGTT 
TTCTTACCATMTAAMGGTTTTTTTAAMAAGCCTGCAGTAAGGTGGTCCCAAACAAGA 
AAACCCTCTTTGCGTCTGGGACCTATAATAACCATCTGCATCTTTAGCTAACTCAAGCCC 
ATTTGATTAATTGGNGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGGTGAAATAATCT 
TTAAAATACAATCCCTTCAMTTGGGGAAATCTTTTAAACATCCCCCGGCGGTACCTCGG 
GCCCGCTTCTAGAACTAGTGGGATCCCCCGGGCCTG 
Sequence 293 

GCTCCeCGCGGTGGCGGCCGCCCGGGCTGGTACGCGGGGAAATGCAAAAAATCAAATCAA 

TTTAATACGAATACATCATGAGATGTTAAAGATTTCCCAATTGAAGGGATTGNNTTTAAA 

GATATTTCACCACTTTTAGCANATGGAGAAGTGCTAAATTACACAATTAATCAAGTGGCT 

GAGTTAGCTAAAGATGCAGATGTTATTATAGGGCCAGACGCCAAGAGGTTTCTTGTTTGG 

GACACCTACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATT 

ACCAGGAGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAAT 

CCAAACTAATATGTTGAAAAAAGGCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTAC 

TGGCGGACAATGAAAG 

Sequence 294 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCTAATTG 
ATCCTGTTCACATTCAGTG AAATGG CATTGCATATTTATATGTTGCTTACAGCTTATTG A 
TTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATG 
CACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGTTTCTTTT 
GCACATGCTACATTGTGCTTTAGACCGGGAGATAATACAGGNGCCTTACCTTACAAATTN 
ATNTTTNTGGCAACNCNMTTNT^r^NGAMTTTTN^^TTAATTTNAAAACCCCAAACCAA 
TTTTCCNNCNCAAAAAI I I I I I I I I I GGG AAAAATTAANTTCTTTAAANN NAACCCCCCN 
AAAAATTATNGGNGNNAAAAGGGNGCCCCNTTTTGGGCCC \ I 1 1 I 1 1 I I 1 1 1 CNCGGGNG 
GGGNAAAAMTTTNAMMAANTTTTTTTTGGGNNCCCCGGGAGAAAAANNTCCCCTNTT 
1 1 I I I I I ICNGGGGTTTTAAAAANGGGGGGNNAAAAATNTTTNNTGGCCCCCCCCNTTTT 

Sequence 295 

TATAGGGCGAATTGGAGCTCCCCGCCCGNGGTCCCAAATGGAAGTGTGAAAACCANGGCC 

CATCCCCCNNTTTTNTAGAGGGGTGGTAAAAAATAMCCCANANATCAAGGGGAGAAAGG 

AAMGGATGAAAGGACAAACTGCCAAAAAMTTTNCCCAAAGTGGCGACTTTTTTAANTN 

TGGGAGCCAGAATTCTGAGGGCTTTGCATTGTCTTTGCAATTCNCTCAAGGAGCCTGAAA 

TTGMAMAMTGCCMCAAGGCCAAATNMCTACTTTTTAGGAGGGGGTTTTGGAGGTC 

TTGGGAAGCCTCATTCCCNTTCAACCNNTCNAATTCTGGGAATGGGGGAAATGGAMGAA 

TAGAAGATGTTGGGTGCCCACTAGGCTACTGNTTGAAAAGGGGAAGCTTGAAAANTTNCT 

NCACCCAAGGTTGGGTATTCAAAAATATTGTAATGGACTGGGTATTGGCAAAAAGG 

Sequence 296 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGTGGGTCCCTTTTCAGAGGT 
TGGGCCTTCTAGACCTCACCTGTTCTCACTNCCCTGGTTTAAATTCAACCCCAAGCCATG 
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GCCAATGGCCAAATAATAGAAATTGGTTCCCTACCCAGCTGGACCAGGGGAGGGAGGTCT 
TGTGCAGTTTCTTGACCACTTTGTTGGTTGGACCATNGGCTTAAATACCAATGGGGTATT 
CGGCTTGAGACCTAAAGTTTGTMGAAAATTNAACCAAAATGGTGCCTGCTTGGGTTAAA 
AATGGGCTACCACCTCAATCTGGACTTCMTTC^ 

TTGGTATTCTTTGGCCTAAAGGTGGCGGTAGTCCCMCCTCTTTGGGTANTTACCCCTTC 
CTAAATAGGTCAATACCTAGGTAGGTCAATACCTCCCTGGGTGGTAAGGNGGTATTTCTT 
CTTA AA AA AG CCTTTTA AA A 
Sequence 297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACT 

CCTACGGGAGGCAGCAGTAAGGNNTTTTCCACAATGAGCGAAAGCTTGATGGAGCGACAC 

AGCGTGCAGGATGAAGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAA 

AATAANNAAAAAAANGGTACCT 

Sequence 298 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACGCGGNGAAATGCA 

MAAMTCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGG 

GGTTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAAT 

TAATCAAATGGCTGAGTTAGCTAAAGATGCANATGTTATTATAGGTCCAGACGCAAGAGG 

TTTCTTGTTTGGGACACCTACTGCAGCTNTTTTM 

ACCTAAAAAATTACNAGGAGACGTTATTAGTTTNGAGTATGATTTAGAATATGGTAAATC 

AACTCTAGAAATCCAAACTAATATTTTGAAAAAAG 

Sequence 299 

CCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCA 

GTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGAAG 

TTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTACCT 

Sequence 300 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCTAGGACAATCAGGAAG 
TMTCTTAAAAAAATAATTGMGATGTTAAMMTACGTTAAAAATAAAAAACTTGTTTT 
; AAACATAGATGCAGTAGAAATTGAAAAACCAGATTTAGATGCAAAATTATTAGCTGAATC 
MTTGCMTTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTTGCAATTCG 
TTTAGCACAAAAAGCCGGAGCTAAAGGTATTMAACTAMGTTAGTGGTCGTTTAAATGG 
TGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACACACACTTAG 
ACMGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTTGGAGTTAA 
AGTTTGAGTTTCATTAGGCGAAGTATTT 
Sequence 301 

CCGGCCAGGTACGCGGGGAAATGCAAAAAAATCAAATCANGNTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCAATTGMGGGATTGNATTTAAAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATNACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCMGAGGTTTCTTGTTTGGGACACCTACTGCAGC I I I I I IA A 

MAAACCTTTTATTATGGTMGAAAACCTAAMAATTACCAGGANACGTTATTAGTTTNG 

AGTATGATTTACAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCMMAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TTMCTTAATCGAATCTCAAGGTGCTGNTGTTCATAMGNAATCTTTTTACTTGAATTAG 

GAI I I I I AAACGGNATTNAAAAACTTAAAAAAATATGACCGTTAGCTCATTAATTAAAAG 

TTTAGTACCTCGGCCCGCTCTAG 

Sequence 302 

AGGTACTTTGATATCTNCGCCCTCTCGTGTGTTCCTTGTGGNGNTAACCAGAGGCAAGAT 

GCCCGAGGAACTTCATGTGTATGTCTACCAGGATTTCAGATGATCTCTAATAATGGAGGA 

CCTGCTATTATTTGTAAAAAGTGCCCAGAAAACATGAAAGGTGTTACAGAAGATGGCTGG 

AACT GCAT TTCTTGCCCTAGTGACTTAACTGCCGAAGGAAAATGTCACTGTCCCATTGGC 

CATATTTTAGTGGAAAGAGACATTNATGGAACATTGNTGNCTCAAGCAACTNGNGAGCTC 
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TGNGATGGAAATGAAAACTCTTTTATGGTAGTAAATGCTTTAGGAGACAGGNGCGTNCGA 

TGTGAGCCAACATTTGNTAATACCAGCAGGTCCTGTGCATGTTNCGAACCTAACATTTTA 

ACAGGGGGATTATGTTTCAGNAGCACAGGGAATTTTTCCTTGTACGTANAATTTCACCTG 

CACGTTATGGAGAAGTTTGGCAT 

Sequence 303 

CCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCMTTGMGGGATTGTATTTMAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTACTGCAGCI I I II I A A 

AAAMCCTTTTATTATGGTAAGAAMCCTAAAAAATTACCAGGAGACGTTATTAGTTTTG 

AGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTMTATGTTGAAAAAAG 

GCCMAMGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TTAACTTAATCGMTCTCAAGGTGCTGTTGTTCATAAAGTMTCTTTTTACTTGAATTAG 

GATTTTTAAACGGAATTGAAMACTTAAAMATATGACGTTAGCTCATTAATTAAAGTTT 

AGTACCTCGGCCGCTCTA 

Sequence 304 

GCGGTGGCGGCCGAGGTACCTTNTCCGAATGCACCTTNAAAGCGGGTATTAGCCTATACA 

GGCTGTTTTAGTCGAATGCAGACCATCAAGGAAATTCNNGAATATCTATCTCAAAGACTG 

CGCATTAAAGAGGAAGATATGCGCCTGNGGCTANTCCANAAGTGGAGAANTACCTTACTC 

TTTCTGGGNTGATGAGGAATCATAAATCTGGAATATTTNGAAAATCCAGGATGAACAACA 

C 

Sequence 305 

GCNGNCGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTA 

MGATTTCCCAATTGNNGGGATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAG 

AAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTNAGCTAAAGATGCAGATGTTATT 

ATAGGTCCAGACGCAANGAGGTTTCTTGTTTGGGACACCTACTGCAGCI I I I I I AAAAAA 

ACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAGACGTTATTAGTTTTGAGTA 

TGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAGGCCA 

AAAAGTAGCAATTATTGATGATGTTTTAAGCTACTGGCGGAACAATGAAAGCCGATTATT 

AACTTAATCGAATCTCAAGGNGCTGGTGNTCATAAAGTAATCI I I I 1 1 ACTTGGAATTAG 

GGATTTTTNAACCGGAAATTGGAAAAA 

Sequence 306 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAATTA 

TGAAATTATTGCAGAAAGAAGATTCACTCTCACCTGATGAATAAGTGTTCATAGGTNAAG 

GCTACAAAATACTAATTTGTTATTATTTTTAATAATAA I I I I I GTTTTGCTGAGAAAGTG 

GATTTACCACTTTTTTATTTTTTMTCCAAGGAGGAAAAATTATTTCCAAACCAAATCCT 

AAMATTTTTCACGTTCTAAACCAGTTCAAGAACATTGAGTAAACAGAAATATTCCATTT 

GTCAAAGTTTTTCTTATCGGCTCAGATAATGAAAAAATTGGGATAATTGAAACAAGAGAA 

GCTATTGAAATGGCAAAAGAACAAAA 

Sequence 307 

CGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGCAAGCGGACGTGAGC 

GATAATGGCGGATATGGAGGATCTCTTCGGGAGCGACGCCGACNGCGAAGCTGAGCGTAA 

AGATTCTGATTCTGGATCTGACTCAGATTCTGATCAAGAGAATGCTGCCTCTGGCAGTAA 

TGCCTCTGGAAGTGAAAGTGATCAGGATGAAAGAGGTGATTCAGGACAACCAAGTAATAA 

GGAACTGTTTGGAGATGACAGTGAGGACGAGGGAGCTTCACATCATAGTGGTAGTGATM 

TCACTCTGAAAGATCAGACAATAGATCAGAAGCTTCTGAGCGTTCTGACCATGAGGACAA 

TGACCCCTCAGATGTAGATCAGCACAGTGGATCAGAAGCCCCTA 

Sequence 308 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATGTTATTAATGTGACTGACA 

AGTMTTAGAAAACTGGAAATTAAATTTTACAAAGATTTTTAAAATCGCTNCAATTAAAA 

AMTTCAAGATGGTTACATTATGAATATGAATGAAATGTCATTAGCGACTTCGTTAAATG 
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TAT ATGTAATTCTATA TTTTCCCCAAMCCCACATTTTATGAAGMTATTTATTTATTTA 

TTTATTTTTGTTTTTTGAGATGGAGTCTCGCTCTGTTGCCAGACTGGAGTGCAATGGTGC 

GATCTCCGCTCACTGCAACCTCCACCTCCTGGGTTCAAACGATTCTCCTGCCTCAGCCTC 

CCGAGNAGCTGGGACTACAGGCACCGNCACCACGCCCGGCTAATT 

Sequence 309 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGACCCTCCTTGATGGTTTACTTTGCAAGCTA 

TGGTGACCTCCGCAAGTTGTGTCTGGGCCCATCCAGGGCTCTGACTAATTGTATTCAAAT 

C AAGGCA GGAGCGGGCCAGCTGGCGTTGACTTAACCAAGCCATTTTATAAGCCTCCCGAT 

CATTTTTAAGCCACTCTAAGTCGTGTAGTAGGATCTGGTCAGAGTTATGTATACTCTGAT 

GGGCATGTGCTGTGTCGTCTAAAATGTCCAGAAGTTCTGAAACACTTTTAGATCTTCCAG 

AATTTCTTGAGGAA GTCT GCCTAAGTAACTGATGCACATCAAGTTCATCACCGGAGGAAT 

CAAAAGMTTTCCATTTTCTATTTCTCTACAGAAAAGAAAAGGATCTTCCTTTAAGATGG 

AAATATTATTTCTCTTC 

Sequence 310 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTTGAATTATCTA 

TTGAAAGMCTACTACATCGAGTTTTTGTTCTTTTGCCATTTCMTAG 

NAATTATCCCAATTTTTTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATAT 

TTCTGTTTA CTCM TGTTCTTGAACTGGTTTAGAACGTGAAAAATTTTTAGGA 

TGG AAATAA I I I I I CCTCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAG 

CAAAACAMAATTATTATTAAAAATAATAACAAATTAGTATTTTGTAGCCTTTACCTATG 

AACACTTATTCATCAGGTGAGAGTGAATCTTCTTTCTGCAATMTTTCTAATTGCCCCGN 

GTACCTGCCCGGGCGGC 

Sequence 31 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAAGGTACCTGACTGTGGC 

TCANATCTGCGTCGCAGCAGCGAGAGAAGAAATCACTCCATATCCGATGAGAGGAAGGGT 

GGCACAGANATGGTGTCTACMTTAGAGACATTTCTGACTCCACCTTAGCCTAAGCAAAC 

TTTATATACTGAGTMCATTTGMGGTTGTCT^ 

TTAAACTACAGT 

Sequence 312 

CCAAMNGGGNCCTGGGGCGTGGTCACGGCCNAGGTACAAAATTATGACTGTTTTAGCTC 

CAGGAACAGATTTAAMGTMTTGMGACGTTTTMAATCTGTTTTTGATGG 

TCGAAAA AAATT G AAAA ACTTGAAAGAACAGAGTTAGCATACGAAATTAANNAGCACAAA 

CAAGGAA I i I I l GTTTTAGCTMCTTAAAATCTGAGGAAAGTTTAATCG AAGAATTTGTC 

AGAAGAGTAMTATTCTCAMAMCAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCT 

GAMGAGGAATGCACAAMCTTTCAGACCTAGAAAAMTGATAMCACAAATTTTTCTCT 

TCTAAAAAACCA 

Sequence 313 

AATNGGGCCTNGCGTGGTCACGCCCAGGTACNAAATTATGACTGTTTTAGCTCCAGGAAC 

AGATTAMAGTATTGAGCNTTTTAATCTNTTTTGATCTMGACATCGAAAATTGAAAACT 

TGAAAGACAGAGTTAGCATACNMTTTAMNGCACAACAGGATTTTTTGTTTTACTAACT 

TAAA TCT GGGGA AGTTTAATCNAAGATTTGTCAGAAGAGTAAATATTCTCAAAAMCAAG 

T1TTMGATTTTTAGTTATTMTCTAGATTCTGAMGAGGATTGCNCMAACTTTCAGAC 

CTAGAAAMTTGATAMCACAAATTTTTCTNTTTTAAAAAA 

Sequence 314 

NGGGCCTNGGNGTGGNNACGANCCAGGTACTTTTACCAAAGAATCTACTAGAACTCTCTG 

CTATTCAAMCMAGAGCTCATACTTGTTGGAGTAGGGAAAAAATTAGAAATTTGACCAA 

MGATAGATTCAATCAACTACAAAGTCMTTCCAAGATGCTGNTAACATCGAAACTCTTG 

MAAAGAACTATTANAATCTGGAGTTGAACTtTAATGGATCATATACCTGNTTTGCTTGA 

TCMGCGGATTGNTCAGCTTMTATTAAAGMGATGGNATCTATTTAGATCTTACTTTAG 

GACGNGGNGGNCATTCGAGNCAAATTTTAAAAAAACTTACTA 

Sequence 315 
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CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATAATTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGAAAAGCTTT 

CTCATATCCAAACGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 

GGTGMGATACTCCTAAATCMGTAATACACCATCTACTTTGTTGATTTGAAGTTTTTT^ 

AGTTCTTGATCAAMTCTTTAAAATCAGATCAAATMATTCMTATTTGAAGAMTTTTT 

MGAGTTTTTCTTTTGTTTGTTCAATTGCTTGTTTGTC 

Sequence 316 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAQCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACCTATCTATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 317 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGMTTATGAACTTAAAAAGCAMTTMGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTTTGCTTCTTGCCAAACAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTAATAGTAGAGAAGGAAGGTTTTTAGCATTAGATCAAA 

TTGTTAAGCTTTGAMTGMAGTGAAAMGTTMMACAMGMGMGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 318 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 

CAGGATTCCTCATTGGTGAGTTCAACCATACATTTTGGGTAACTCCTGAAGACATCTGCA 

MTTGTGAGTTAGTTGGTGGGGTCCATTMCATTTGCATATGTTATTCTTTCTACTGAAG 

TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 

TTTGGATATTCAGCAGNGGCTGNGAMCTGGATTTGAATTACCGGGATACATGCATGCTT 

CTTGGTT 

Sequence 31 9 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAATACGTAGAGATAATMCAGTTTTT 

TAAAAMCTTAATATTTGTTATTGAATGTATATTTTTGAGTATTGCATCTTTTCTAT 

MTAAGGAGGTGTAATTTGAACGCTTTTAGAAAGAAAAAAGAAGAATTAGTGCAAGAAAT 

TMAGATTTGATTAATTCTTCTTCTTCATTAGTTATAGCTGAATATCGTGGATTAACAGT 

TGCTGAAATTGAAACTCTTAGAMTGAAGCTTAMGAAGCAGGTGTTTTTGTMAAGTTT 

ATAAAMTAGACTATT TAAAATAG CATCTAMGAAGCAGGTTTCGGAGATTTAGAACAAT 

CACTAGTTGGTCCAAATTTTTTTGCTTTTGGTTCTACAGATGCANTAGCTCCAGCTAM 

TTATTTCAMATTCGCTAAAMCAMTCCAGTMGTTGTATTAAMGGCGGTATTTTTTG 

A 

Sequence 320 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACGGAATTTTAAAAATAGACTNTAAAAAACCN 

TCAATTGCANMCTMTTTTGNTTGAmACTTCAMTTGAAGGAACTCTTTTAGGT^ 

GTTGNAATAGTAAANGAAATTCGATNAGTAATTCAATCAAAGAATAATGAATNGACAAAT 

TCTGACATAAAAACAGCCAAAGAATATCATTTAGAACAACTAANAGCGCTTAAAAAAACT 

TCAMTTAACAMTATAATTTATMTTTATGGCACTATGMAAAAAAGNGACTTTATTTA 

ATATCAACTTTANCATTTTCMCAATTTTATTAGCTATTNCTNGTGGTAAAAATCAAACT 

ACTCCTGTAGGCACACAACCANTTGATANTCCAAGCCTGGAGGATTCATCAAGNGAATCA 

ANAGTTCTGGATTTAAATTAATAGAAAAAAAG 

Sequence 321 

CCCTTAGCGTGGTCGCGGCCGAGGTACAATTTCTAATTGATCCTGTTCACATTCAGTGAA 

ATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGATTTAGGTAACTATTGTGTCTT 

CCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATGCACTCTTATATTCTGCCTTTC 
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TGCCTGGAGTTTGAMTACATGTCTCTTTAGT^ 

AGACCGGAGATAATACAGTGACTTTACCTCACAAATCATATTCTGTCAACACAAATCTAT 

GMTTTAGTTTATTTAMATCAGAACAATTTCCTACAAAATTTTTCTGGAAAATAGACTC 

CTAACAGACCTACCAGAATCATGCTTAAAGGGCTCCCTTGACACTTATTCTATACTGAAG 

GATAAATTTTAAAAAAT 

Sequence 322 

CCCTTTCGAGCGGCCGC CCGGG CAGGTAC 1 I I 1 I I I 1 1 1 I I I I 1 I I I I 1 I 1 1 I IAGTTAA 

ATATAMTTTATTTATTAACTTTTTCAATACCTTTTGMGGTCAAA 

TTAGGTTCTTTTGAAAGTAAATTCCCACAACTGAAAGTGCAGCTAGAATAATAAATACTG 

CATATGCAGCTTCTGTTGATGCTGATTTAATTCCTGCAACTAGAACAGTTGCTATAGAAT 

AMATGCATMCCTMTCCTCANATAAGTCCMATTGATATCCAACTTTTTTAGGATTTG 

ATCCTTTGTATTCGGGAGGMGATTTAAGATMCACCTTGMTTCCTCAAAGCATTACTC 

CCATTAAAAATCCTAAAATATAGAATAATCAAGTTCCATTACCTA 

Sequence 323 

G CGGTG GCGGCCGCCCGGGCAGGTACAAGGTAAAGCAAGAGCTGGCTCTCTACGTTCACC 

MTTTTTGTAGGTGGTGGTCGTGCATTTGGGCCTACAAATMTMAAATTACAAAATTAA 

ATTMACAAAMAGTTCGCMATTAGCTTTTGCCTCAGCTTTTAAGTCAACTTGCTCAAA 

ATAATCAAGTACCT 

Sequence 324 

NNCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAA 

TAGAAT ACAT CAGAGATGTTAAAGATTTCCCAATTGAAGGGATTGTATTTAAAGATATTT 

CACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTANNCAAATGGCTGAGTTAG 

CTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGN AG CTTTTTTAAAAAAACCTTTTATTATG GTAAGAAAACCTAAAAAATTACCAG GAG 

ACGTTATTAGTTTTGAGTATGATTTAG 

Sequence 325 

CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATMTTTGTGAATATATTAACTAAT 
TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGMAAGCTTTGAGCTTGATT 
CTCATATCCAMCGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 
GGTGMGATACTCCTAAATCMGTMTACACCATCTACTTTC 

AGTTCTTGATCAAAATCTTTMAATCAGATCAAATAAATTCAATATTTGAAGAAATTTTT 

MGAGTTTTTCTTTTGTTTGTTCMTTGCTTGTTTG 

Sequence 326 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACCTATCTATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 327 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAMTATTTAAAAAA 

ATGGAGGAATTATGAACTTAAAAAGCAAATTAAGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTTTGCTTCTTGCCAAACAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTMTAGTAGAGMGGMGGTTTTTAGCATTAGATCAAA 

TTGTTMGCTTTGAAATGAMGTGMAAAGTTAAAMCAAAGMGMGGATTTTATCCAA 

TMCACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 328 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 
CAGGATTCCTCATTGGTGAGTTCMCCATACATTTTGGGTAACTCCTGAAGACATCTGCA 
AATTGTGAGTTAGTTGGTGGGGTCCATTAACATTTGCATATGTTATTCTTTCTACTGAAG 
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TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 
TTTGGATATTCAGCAGNGGCTGNGMACTGGATTTGAATTACCGGGATACATGCATGCTT 
CTTGGTT 
Sequence 329 

ATAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTMTTAATGAGC 
TAACGTCATATTTTTTAAGTTTTTCM 

TTACTTTATGAACAACAGCACCTTGAGATTCGATTMGTTAATMTCGCTTTCATTGTTC 

CGCCAGTAGCTMAACATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATATTAG 

TTTGGATTTCTAGAGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGTCTC 

CTGGTAATTTTTTAGGTTTTCTTACCA 

TTAGGGTGGTCCCAAACAA 

Sequence 330 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACCTCTGATTCTCACTAGTTGMTGCAAGAAC 
TTGAAAGGTTCAGGTMGTGTTTTGAAMATTTTGACTTT^ 

ATCTGAAACTCAGGAATCAAAMATACCGACAGGCACTGTTACTTTCAAAATTCTTTCTA 

TMGTTGAGAATGGGACAGATTTGCAGAGCAAGGGAAACTTGAACAGTTACTTCTAGTGG 

TAGGAAATGAGGTGGCTAGGATATTACCCAGCTGGTGGGTGACTTGGGCAGTGTGTTCCT 

GCTTTCAGTGGTTAGCCTTTAGCAAATCTGCTTTAGAGTGAGAGTAGAGGGCAGGCTGTT 

GTATTACAGTGCTCTTGTTTTTGTAMATTTAATTCACTCTACTGNTATTTTGTCTCCTT 

G G GT AAAG N G NT ATTT AATTTTTCT 

Sequence 331 

ATTATTGCNGAAAGAAGATTC ACTCT CACCTGNTGAATAACGTGTTCATAGGTAAAGGCT 

ACAAAATACTAATTTGTTATTATTTTTMTAATAATTTTTGTTTTGCTGANM 

T TACCA CTTTTTTATTTTTTAATCCAAGGAGGAAAAATTATTTCCAACCAAATCCT 

ATTTTTCACGTTCTAAACCAGTTCAAGMCATTGAGTAAACAGAAATATTCCATTTGTCA 

AAGTTTTTCTTATCGGCTCAGATAATGAAAAAATNGGGGATA 

Sequence 332 

CCGCGGTGGCGGCCGCCCGGCCAAGGTACGCGGGGGCAGAAGAGGAAGATTTCTGAAGAG 

TGC AGCTG CCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGGTA 

AATATTTTCCCTTCGTATTCGGTAGTGCTGTTGAGTCATCTTGTCCAATGCAAATCCTGA 

GMGCTATGTTCCCAAAGAGGGCCAGCTCCATTTAGTGTTTGTTTATAGCCTTACTATGC 

CTCTACC TCTGG GGGTTGTAAATCTGTTNTACCCAATGGGNGGGTTTGTTNCCCTTCCTG 

MNCAMTTTTTCTGCTTNNACACTTGGGCAMCCNTTTCCTAAATTTCATCCTCCCANA 

ACTTTGCNCCNCCNTTGGGGGGAGGTTTGGGGTTTTCAACTCCNGAANAAAAAGAGGGGC 

CCCCCA CCNN AGGGNNTNNTTTNTTTTAAATTNGGGCCNNGGGGGNTTNNANTTAAAAAA 

NNGGGNTTTTNGGGGGACNNTTTTTNTTTTNACCCCCCCCCCCCTTTT^ 

Sequence 333 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGTTGACCTTCCTTG 

CCCAAGAATGAATGATTAAAAAGTTGAAAAGCTGTGTTTGTAGATCGTTCAGGCCTTCGT 

CTGCAACTATCAGGAAATATGGGTMCTCTCTTCCAAGAAACTTCCCCACTCTTTTGATA 

AGCATCTCGAAAATGTTGTTCGAACATCAATGGTGGTATTTTCTGAAGATGAAGAATTAA 

AGCAGTTCTCAAAGAAAGAAGTTCAGGGAAGTTGAAATACGCATACTCGGCATCCTTGAA 

GAAAACTATGGTGAATTGTCCTCCTTTGCTAATAAGATCCACAAGATAGCCGTTCAACCA 

GATAGAAAAAATGGGAGGTNCCTGCCCCAGGCTTTAAANTGANTNNCNCCNCAGCATCCC 

CTTGGGGATNNGGTAANNNCCNCCCCCCCNANCAAAAAAAAAAATTTTTCCCCNCCNACC 

NMTTNTGMAAACCNNGNGGGGGTTTAAAMTTTTGNGNTCNNAATTTNAAAAAAAAAA 

AN 

Sequence 334 

CCGGGCAGGTACTAGGAGATATTGATTCTAGTCAATTAGGCATTGTAGACTGTCATGACC 

ACTTAATAAAAAATTATGGACCTGAAGCTCACGAGCATCCAGATTTTATTATGATGTCAA 

AAGATGCTGCAATTAAAGAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTA 
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CMTGGACCCTCCTMCGTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAAT 

TAGAAGGAAMGCTAACATTATTATGGCAACTGGTTTTCATAAAGCTGCATTTTATGACA 

AAGGTGCTTCTTGACTTGCTTTGGCTCCAACAGATAAAATTGNAAAAATGGTTGTAGCTG 

AAATCGAAGAAGGAATGGATGAATATAACTACAGCGGACCAGTTGTAAAAAGATCTAAAT 

CCAAAGCCGGAATTATTAAAGC 

Sequence 335 

CTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGAMTGCAAAAAAATCAANNCAGTNNANT 

CNAATACATCACAGATGTTNAAGATTTCCCAATTGAAGGGATTGTATTTAAAGATATTTC 

ACCACTTTTAGCAAATGGGAGANGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTATAGGTCCAGACGCMGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAAAACCITrrATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTNTTAGTTTTGAGTATGATTTAGAATATGGTAMTCAACTCTAGAAATCCAAACTA 

ATATGTTGAAAAAAGGC 

Sequence 336 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATGCAA 

AAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGGG 

ATTGTATTTAMGATATTTCACCACTTTTAGCAMTGGAGAAGTGCTAAATTACACAATT 

AATCAAATGGCTGAGTTAGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGT 

TTCTTGTTTGGGACACCTACTGCAGCTTTTTTMAAAMCCTTTTATTATGGTMGA 

CCTMAMAATTACCAGGAGACCGTTATTTAGGTTTTNGGGTNNAGATTTAAAATTGNGG 

AAANCCCCNNTTNTAGAAATCCAAACTAATTNTGTTGNNAAAAGGGCCAAAAAGTNCCAA 

TTATTGGTGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTATTAACTT 

Sequence 337 

CCGCGGTGGCGGCCGAGGTACCAATAATAGCAACCCTGTGATTTGTCCAAGTGCCCGGGA 

GTGGAGGCCATCCTGACAACAGCTCTATGATTTTCTATGCCAATGACACAGGAGCCCAAC 

AGTTTGAAAAGTGGTGGGATAAGTCCAGGACAGTCCCCTTTTATCTTGTAGGGCTCCTCC 

TCCCACTGCTCMTTTC AAGTC TCCTTCATTTTTTTCAAMTTTAATATCCTAGGCACAG 

TGTCTGTCCTTTATTTGATTTTCCTTGTCACCTTTMGGCTGTTCGCTTGGGATTTCATT 

TGGAATTTCATTGGTTTATACCAACAGAAT I I I I IGTACCTGCCCG 

Sequence 338 

CCGGGCAGGTACCTGGAAGACTTCTCCACCTCGGGGGCCTGGCTGCCTCACAGGTATGAA 
GACAACCACCATAACTGCTACTCTTACGCACTCACGTTCATTAACTGCGTTCTGATGGCA 
GAAGGTAGACAGCAACTGGACAAGGGTGAATTTACGGAGAAGTACCT 
Sequence 339 

ATAAACTGCGGGATCTCAATGGCTTCTATGATCGTATTGAGGCAGTAGTTCCCACACTCT 

GCCCGGTGCCAGCATGAAAGAGAACAGGGAGAGTCAGCCTTTCACAGTCCTTGTCAMTC 

CCAATTACTCTGGTTGCAGATCACTTGAAGCCTGCCTTGTTTCTCTCACAAAGCTCTGCC 

CGAGAGTCCAGCCCCGCGTACCTGCCCG 

Sequence 340 

GCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAAGATAGTCATNTCAGTAAAAGGT 

CTATTATCTAACTTGCCAAACTTGTTTANNGAGAGCCCTAAGGAACTAAAACTGCCATM 

TGCCI^GCACAGCTTGAMAGCAATTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAG 

TTCCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAA 

AATAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCAC 

CTNTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAG 

Sequence 341 

TAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTCTAGGACAA 

TCAG GAAG TAATCTTAAAAAAATAATTGAAGATGTTAAAAAATACGTTAAAAATAAAAAA 

CTTGTTTTAAACATAGATGCAGTAGAMTTGAAAMCCAGATTTAGATGCAAMTTATTA 

GCTGAATCMTTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTT 

GCAATTCGTTTAGCACAAAAAGCCGGAGCTAAAGGTATTAAAACTAAAGTTAGCGGTCGT 
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TTAAATGGTGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACAC 
ACACTTAGACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTT 
GGAGTTAMGTTTGAGTTTCATTAGGCGAAAGTATTTGCAAAGCAAAATCAAGCATATAA 
TGAAGAAGAACCAACNCACAAAAAAGGGCCAAAAAGAGCAGCMGAGTTAAAAAAGAA 
Sequence 342 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACC 

TTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGA 

Sequence 343 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCATGAATGCCGTGCTCCAGGTGTTCAC 

AGCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGC 

CTGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCT 

GGAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTNTTGCCAGATGACAAGGN 

GACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCAC 

Sequence 344 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGA 

GCTAACGTCATA I I I I I I AAG I 1 1 I I CAATTCCGTTTAAAAATCCTAATTCAAGTAAAAA 

GATTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGT 

TCCGCCAGTAGCTAAAACATCATCAATMTTGCTACTTTTTGGCCTTTTTTCAACATAAT 

TAAGTTTGGATTTCTAGAGNTTGATTTACCATATTTNTAAATCATTCCTCAAAACTAATA 

ACCGCCNTCCTGGGAAATTTTTTTAGGGTTTTNTTACCCCTAMTAAAAAAGGGNTTNTT 

TT 

Sequence 345 

CCGCGGTGGCGGCCGCCCGGGCACGGTACCACTTGAATTATCTATTGAAAGAACTACTAC 

ATCGAGI 1 1 I IGTTCTTTTGCCATTTCAATAGCTTCTCTTGTTTCAATTATCCCAATTTT 

TTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATATTTCTGTTTACTCAATG 

TTCTTGMCTGGTTTAGMCGTGAAAMTTTTTAGGATTTGGTTTGGAAATAAI I I I ICC 

TCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAGCAAAACAAAAATTATT 

ATTMAMTAATAACAAATTAGTATTTTGTAGCCTTTACCTATGAACACTTATTCATCAG 

GTGAGAGTGAATCTTCTTTCTGCAATAATTTCTAATTGCCCCGCGTCCTTGGCCGCTCTA 

GAACTAGGTGGG 

Sequence 346 

CACTACTATAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGA 

GATATTGATCCTAGTCAATTAGGCATTGTAGACTGTCATGACCACTTAATAAAAAATTAT 

GGACCTGAAGCTCACGAGCATCCAGATTTTATTATGATGTCAAAAGATGCTGCAATTAAA 

GAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTACAATGGACCCTCCTAAC 

GTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAATTAGAAGGAAAAGCTAAC 

ATTATTATGGCMCTGGTTTTCATAMGCTGCATTTTATGACAAAGGTGCTTCTTGACTT 

GCTTTGGCTCCAACAGATAAAATTGTAAAAATGGTTGTAGCTGAAATCGAAGAAGGAATG 

Sequence 347 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTAGAAGAAGAAGG 

AATACCTAAAGAAACAGACATAGAAATCATCCCAGAAATCCCGGAAACTCTAGAGCCACT 

GTCCCTTCCAGATGTGCTGAGGATCTCGGCAGTTCTGGAGGACACCACAGGCCAGCTCTC 

TATTCTGAACTACATCATGCCCGTTCAGTACCT 

Sequence 348 

TNCTTAGGGCGAAtTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTMTTAATG 
AGCTAACGTCAT A I I I I I I A AGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAA 
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AGATTACTTTATGAACMCAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTG 

TTCCGCCAGTAGCTAAMCATCATCMTMWGCTACTTTTTGGCCTTTTTTCAACATAT 

TAGTTTGGAT TTCTAG AGT TGAT TTACCATATTCTAAATCATACTCAAAACTAATAACGT 

CTCCTGGTAATTTTTTAGGTTTTCTTACCATAATAAMGGTTTTTTTAAAAAAGCTGCAG 

TAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAG 

CTAACTCAGCCCATTTGATTMTTGTGTMTTTAGCACTTCTCCATTTGCTAAAAGGTGG 

Sequence 349 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAACAGGT 

GCTCCTGTTAAAATAGMGATCTTGCAGCTACTTCTAGAGATTTAAATTCTAAACGATCA 

ATAGCAGCGTATCCTGTTCCGGCTTTAATMTTCCGGCTTTGGATTTAGATCTTTTTACA 

ACT GGTCC GCTGTAGTTATATTCATCCATTCCTTCTTCGATTTCAGCTACAACCATTTTT 

ACAATTTTATCTGTTGGAGCCAAAGCAAGTCAAGAAGCACCTTTGTCATAAAATGCAGCT 

TTATGAAAACCAGTTGCCATAATMTGTTAGCTTTTCCTTCTAATTTCTTTGCAATATCT 

MC ATTT GATAAACATCACGCGCAACGTTAGGGAGGGTCCATTGTAACAACAGTTTTTCC 

TCCTTTTGCTACATATTCATTCATTTCTTTMTTGCAGCATCTTTTGACATCATAATAAA 
ATCTGG 

Sequence 350 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCeCGGGCAGGTACATTCTTCACTATCA 

CTGTCCTGTAAATTTAGTAGCCTTGGCTGGAAACACTGTAGTCGACATGATCTGATATTG 

CTTAATATTTCAGAAAGAGACAGTCTATTTTCACAATGTTTACTGGAAGCATTGGTCCGA 

GAGAAATTAGAAGAAAAGTCTATAGTTTGGGAAGAGCTTGAAAAACTATTCAGCATTTCA 

GGGTCTATCTGTTTCAGGACTGGGTCATGTTCTGTGGATATTCGGTCCATTATGACCCTT 

CCACCTCTGCCAATTCGCCTCCTTGCAAATCCTATACATCTTCTTGGGACTGTAAGTGTT 

GTAAGGC 

Sequence 351 

CTNCTTAGGGCGAA TTGG AGCTCCCCGGGGTGGCGGCCGAGGTACAGCTTGGTGACTGTA 

ATTAACAACMTGTATTTTGAMGTCACTGAGTAAATTTTAAGTGGTTTTTCCAAAAAAG 

CACATAAGGTAATGCACCGTTAATTAGCTATATTGAGCCATTCCACAATGTATAGATATT 

TC AAAACATGTTAC ACAT GATMA TCCAGTTTTTCTACGTCATTTTTTAAAATTATATTT 

AATi 1 1 I i I ATTTTGAAAGTTTTTTCACAGATC I I I I I I I I AGTATTATTACCTTCTGAT 

ATATGTGTCATTATTGMGAACCATACCTTTTAAAGGTATTATTTTGTAATTAAGGTATG 

TC AACAGT AAAAAATAACCAGTGGCCCAGGCCATNGGGGCTCATGCCTGTAAATNCCAGC 

ACTTTTTTCGGAGGCCCG 

Sequence 352 

CCGCGGTGGCGGCCGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCT 

TGATCCCCAGAACTGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCT 

TCAAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGAGAACAAAGATTTGGGGCACTTTC 

MAATTAATGAAAGGTTTACAAATGACCTTTTGAATTCATCTTCTGCTATATACTCCAAA 

TATGCAAATGGAATTGAAATTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTTTGAG 

GTCGGTTCAGGTCACCCAATTTCGAAATGGAAGNCATCGTTGCTGGGTATTGAAAGGTTT 
GT 

Sequence 353 

CCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATTCT 

TCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGACAA 

AAACCAAAAAMCAAAAACTTATTTTAGAAAGACTTGGATAATGATTGATTATAAAAAAA 

TGATTGACCATACTTTATTMMCCAGAAGCTACATCTAMGATATTTTAAAACTTATTT 

CACAAGCTAAAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATTAG 

CAAAAGAAAMCTTGCAMTACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATTAG 

GTG CATCAAAC ACACAAACCAAGGTmTG AAGCAAAATTAGCAGTTG AACATGG AGCTA 

CTGAAATAGATATGGTTATAAATGTGGGTAAATTCAAA 
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Sequence 354 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGCATGAAACCACACA 

TACGGGCAAAAGCAGGAAGAGACTTTGCAGACATAAAAACCAGGTAAGTTTTGAGAGTCA 

GGTGATTGCAAG A I I I I I I 1 1 I CTTTCATTGACACTTAGCTAATTTTGAGCCATATTTTT 

MATGTGTGTCTTTTCAGTTACTTGAAACCTCTTCTCCCACCTCCCATCCTTTCCAAACC 

TCCTCCAAGCCAATTTTCCAGTTGGTTTAAAAGAAATCATTAGGGTGACCCCCACTCCCA 

CTTTCTGGAACGTCAGGCAGAGCTCCTGGGGTTCCTCC 

Sequence 355 

NNI I I I I I I I I I I I I I I I I I I I IGCGTGGAATGCTTCACCTCTTGCTAAAGGAACAGTAA 

TGTCATATCCTGTTAATCTTGCAGGAGGCGCTAATAAGAATTCAAATGCTTTCTCATTTC 

TCTTGTAATAATTTCAGTGAACTGAAAATGATTTAACAGCTTCGTGAACAACTAAAAGTC 

TTCCAGI I I I I 1 IAACAGAGTTAATAATTGTATCTGTATCTAAAGGAGAAATTGTTCTTA 

AATCMTTAATTCTACAGAGTATTCTCCATTTAATTGTTTTAAAGCAGCTAGTGCTTCGT 

GMCTTGTGCTCCATATGTTACTAATGTTAAATCAGAACCTTCTACTAATACATTTGCTT 

TACCMTTTCAACTTCATAAATTCCTGCTGGAGCTTCTTGTTTGAATGAACGATAAATTT 

TCTTAGGTTCTAAGAAAATAACTGGATCTGGGTCGTTAATAGCTGCGATTAATAATCCTT 

TTGTATCATAAGGAGTTGAAGGCATAACAACTT 

Sequence 356 

AGGGCGAATTGGAGCTCCCCGCGGTGGGCGGCCGAGGTCTAAACTTTAATTAATGAGCTA 

ACGTCATA I I I I I IAAGI I I I ICAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATT 

ACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGTTCCG 

CAGAGCTAAAACATCATCAATAATTGCTAC I I I I IGGCCI I I I I I CAACATATTAGTTTG 

GATTTCTAGAGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGTCTCCTGG 

TAAI I I I I I AGGTTTTCTTACCATAATAAAAGG I I I I 1 1 I AAAAAAGCTGCAGTAGGTGT 

CCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAGCTAACTC 

AGCCATTTGATTAATTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGTGGTGAAATATC 

TTTAMTACAATCCCTTCAATTGGGAAATCTTTAACATCTCTGATGTATTCTATTAAATT 

GATTTG A I I I I I I I GCATTTCCCCGCGTACCTGCCCGGGCGGC 

Sequence 357 

GAATGGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCT 

ATTATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATG 

CCGTGCACAGCTTGAAMGCAATTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTT 

CCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAG 

Sequence 358 

AGGTACANGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTTGAAAAGCAATTMGA 

GTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCAG . 

GAGGGGAAGGGAAATATAACGGAACCCAACAATTAANAAATAGGCAAATAGCCAATTAAA 

GTAAGGAATGGNCATCCCATGGGAGGCANCAACCATTAATTTCTTGGGAACCCACTNTNT 

CCCNGGGATTCAGGGCTTCCATTTGCTTCACNGATGGCTTCACGTCTGGNGCAGCCCGCC 

AACTCTTACTTTGCCAGGAAACCCTCACCTCAACTTTGCCCAGGGTATTTCTNCCCCGGG 

TCTTGGAAANGAAAATGGGCTTCNTCCACCTGAAAAAGGGTTNGAATCCTTTCTTCCCAT 

TACCCAGGCTTTCCNTTTAAAGCCAAAMGGCAAATTCCCTCCTTTTTTGGCTTTTCCT 

Sequence 359 

CTAATTGATCCTGNTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTNACAGC 

TTATTGATTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTT 

CTCTATGCACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGT 

TTCTTTTGCACATGCTACATTGGGCTTTAGACCGGAGATAATACAGTGACTTTACCTCAC 

MATCATATTCTGTCAACACAAATCTATGAATTTAGTTTATTTAAAATCAGAACAATTTC 

CTACAAAATTTTTCTGGAAAATAGACTCCTAACAGACCTACCAGAATCATGCTTAAAGTG 

CTCCCTTGACACTTATTCTATACTGAAGGATAAATTTTAAA 
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Sequence 360 

CCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGTCGTGCACAGCTT 

GMAAGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACC 

Sequence 361 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I I GTTTTATATT 

CTTCAMTCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGAC 

MMACCAAAMMCAAAAACTTATTTTAGAMGACTTGGATAATGATTGATTATAAAAA 

MTGATTGACCATACTTTATTAAMCCAGAAGCTACATCTAMGATATTTTAAMCTTAT 

TTCACMGCTAAAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATT 

AGCAAAAGAAAMCTTGCAAATACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATT 

AGGTGCATCMACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACNTGGAGC 

TACTGAAATAGATNATGGGTATAAATGGTGGGTAAATTTCAAA 

Sequence 362 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCT 

TGGTAGGCCATTACCATACCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCT 

GTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTG 

TTTCCAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCT 

Sequence 363 

CACTACTATAGGGNTNATTGGAGCTCNCCGCGGTGGCGGCCGTGCTGTTGCTTGGCCGCG 

CGCCAGGCTGGCCAGAGGTGCTGTTCCACTGGGGCGTGGCCGTGATGGTGTGCGCCGATC 

GACGTTCTGTTTTAATGAAAGGATGGACATGCAACTGACACTCGCACAACTCTTGGCAAT 

CATGCCCAATGCCCGCTCCAMGCGGGTATTTTTTTGCCTGCGCTAAACGTGGCCATGGC 

GGAATTCGGTATCAACACGTCGGCGCGCCAGGCCGCGTGGCTGGCCACCATCGGTGTCGA 

GTCCGGTAGCCTGCAGCGGGTAGAGGAAAACTTGAACTACCGCGCGGATCGCCTNCTCGT 

TATTTTCGGMAATACTTCACGCCGGCGTTGGCCGCAGCTTATTGCCGGCAAGCCGGAAA 

TGATCGNCAACCGTGTTTACGCCAACCCGCATGGGGGAAACG 

Sequence 364 

AGGTACTAAACTTTAATTAATGAGCTAACGTCATA I WW l AAG l I 1 I I CAATTCCGTTT 

MAAATCCTMTTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCGATT 

MGTT MTAATC GTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTT 

TTGGCCTTTTTTCMCATATTAGTTTGGATTTCTAGAGTTGATTTACCA^^ 

ATACTCA AAACTMTAACGTCTCCTGGTMTTTTTTAGGTTTTCTTACCATAATAAAAGG 

TTTTTTTAAAAAAGCTGCAGTAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTAT 

AATMCATCTGCATCTTTAGCTMCTCAGCCATTTGGATTAATTGTGTAAATTTAAGCAC 

TTCTCCATTTGCTAMAGTGGTGAMTATCTTTAAATACAATCCCCTTCAATTGGGGAAA 

TCTTTAAACATCTCNGGATGGTATTCTATTAAAATTGAATTGAA 1 I I 1 1 I IGGC 

Sequence 365 

CCGCGGTGGCGGCCCGAGGTACCAMATAAAGGGTATTTGCTACCTTTAATACTTGCCAG 

TTCAGGTTGGAGGCACAGGCAGCAGCAAGAATGGAMGAAATGTTCTTACMCATTTTCA 

CAG GAAAT GTCCCAGTTMTTTTGAATGAMTGCAAAAGCTGMTATTeCAGTTTATTCA 

ATGATTTTGTTGMTCTGMTTTTTTTTGATTGATGGGGATTCATTACT^ 

TCTGTGAGATATCATTTMGCCTGGGCAGAACCTCCATTTCTTCTATCTGGTTGAACGCT 

ATCTTGTGGATCTTATTAGCAMGGAGGACAATTCACCATAGTTTTCTTCAAGGATGCCG 

AGTATGCNTATTCMCTTCCCTGGACTTCTTTCTTTGAGMCTGCTTTAATTCTTCATCT 

NCAGAAAGAATACCCCATTTGATGTTCGAACAACATTTTCGAGATGCTTATCAAAAAGAG 

TGGGGAAAGTTTCTTTGGAAGANGAGTTACCCCATATTTNCTGATTGTTGGCAGACGAAA 

NGCCTTGMCGATCTACMAACCNCAGCTTTTTAAACTTTTTAAATCNTTCMTTCCTTG 

GGGCAAAGGGAANGMTNAACNTTTGGTACCTTGCCCGGG 



• 
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Sequence 366 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

MTAGMTACATCAGAGATGTTAAAGATTTCCCMTTGAAGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTCACAATAATCAAATGGCTGAGTTAG 

CTAAAGATGCAGATGTTATTATAGGTCCAGACCCMGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAMACCTTTTATTATGGTAAGAAAACCTAA 

Sequence 367 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TMCTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 368 

ACCGCGGTGGCGGCCGAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACG 

GGAGGCAGCAGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGNGTG 

CAGGATGAAGTTNTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAANAAAAAAAAAAA 

AAAAAAAAGGTNCCTGCCCG 

Sequence 369 

GGCGGCCGAGGNACAATATAGNCATCGCNTTAAACNGCCNANTNTTAANCNCGCCAAACT 

TGGTTACTGAGAGCCCTAAGGAACTAAAACCGCCATAATGCCGGGCACAGCTTGAAAAGC 

AATTAGAGGAAGCAAGANNAGNNNTTCCTCCCTTCCAGNNCCTCAGCAGGCCTGGCTGAA 

GGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGA 

ATGCCATCCCANGGAGCACACCAAAATTCNGGAACCACCNCTCCCGGANCAGGCNTCCAT 

TGNTCACGAAGCTCACGCNGGGCAGNCCGCAACTTTACTTTGNAGNAACCTCCCCACTTG 

GC CAAGG GAATTCNCCCCCGGGCCTGGAAGAAAAGGGNTCTCCCACCCGGAAAGGGGCGN 

ACCTTTTCCCAAAACCAGCCTTTCCTTAAAGCNAAAAGCAAACCCNCTCTTTTGGGTTTC 

NCAAAGGGGGCNGNACAAAAGGGAAGGGTTTTGGGGCNGGGGGGGGGAAACAAAANCCCC 

NCATTNGGAAGNTTGCCCCCGGCCGAGGGGAAGGGGAAAAGGTTGGNCCCCGGTTGGGGG 

GGG 

Sequence 370 

ATTGTTGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACANGGAACTGCCAAAGGCAACA 

GAAATTCTTTCTCCCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTC 

AGAAACCAGATTCAGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGG 

CCAGGGAAGGAGANNAAGGTAGNCAAGCGGGGGTGGTCTCAAATCTGGTTGNGCTCGAGC 

TATGCAAATGCCTCTCATGGAGATGCGAGGACCTATCTATTATGATGACCAGGGCCACAT 

CCGGAGGGGGCAACAGACTTTCATNTATCAAGCCCTT 

Sequence 371 

AGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGC 
TTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCA 
GAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATT 
TTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 
Sequence 372 

CCGGGCAGGTACCGGGTCTAGGGAAGATGCAGAACACTTCAGCCTGGCAGAAGGCTCTAA 

AGATCGTCAACCTTTCCTCTGTTCATTTTGMATTTTTAAAAATATGCTAACTTAGCTGA 

AAATCTCATGGAAACCAGGCTCCTCATCAGACTTGAAAGTCAAACCGGTTTCTCAACAAC 

TTCTCTTTATGGTTCTGTATGGCTCCACAGAAAACCAGAAAAACATTTGGGGCAAGAAGC 

TATGACTCTGTGAGGCCACATGGGAGCAGGCAGTCAATTATTACACTAAGGAACACCCAG 

TTAGCATGAAGTATCCCATCACCTCGGTATTAAGCCCTGCATGCATTAGCTATTACCT 

Sequence 373 

AC I I I II I I I | ] I I I I IGCTCAATAGAAGTATGGAATAATTCCAGGTAATTTAAAGCATA 

TTTTTCAATTGGTGTAAGCTGCTCCATGAAGTCAGCTAGCTCCTCTAATTGGGATGGTTC 

TTCATCACACGGCATGTTCTCAGAGTCTGATGACAGAGCATCAGTGTGTGGTCCCAGCAC 
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CCCCTCCTGTGCGGACTTCTGGGCATCCTCCTCCAGATACTCAATACTCTTGAGGGCCTG 

AGGMAGTCTCTATGAAAGGTCTTGCMTTTTGGGTGCAATGGTTTCCGTGACAGAAGGT 

TCCTGAGAAAGCACCACAAACTCCTCAGCTTTGACCGGGAAGCCAGCATCACGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 374 

CCGGGCAGGTACCGCTACTGAAATTATTAAACATACACTACAGATCAATTATATAANTAT 

GTTAATATCTTTAGAAATCAAGAGTTGCAGCATAAGAGAAAGGGATACAAAAACAAAACA 

AGC AAAGA AGTTACATAAAAACGTAACGTTGTATTGGAAAAACCAGTATGAACTTATGAT 

TTAGTTTTCTTTCTAAAAACGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 375 

NGCGNTGCGCTCACCTGCCCNCTTTCCANTCGAGGNAAACCTGGTCGTGCNAGGGTGCNA 

NTMTGMTTCGNCCAMCNCCNCCGGNNGAGNAGGCGGTTTTGCGTNATTGGGGCCGCT 

ATTCCNCTTTTCTCGCTCACCTGACTTCGCTGCCGCTCGGTCGNTCGGCTTGCCGGCCGA 

AGCCGGGTAATTCAAGCCTCCACTCAAAAAGGGCCGGGTAAATTACCGGGTTTNTTCCAC 

CMNGAAATTTCAAGGGGGGGATTAAACCNCNAAGGGAAAAANGAAACATTGTNNGANNC 

AAANAAAGGGCCCNNNCAAAAANNGGGCCCANGNNGAACNCNCCGTAAAAAAAAAG 

Sequence 376 

CCGGGCAGGTACACTCCAGCTCCTCTATCCTTTGTTCCAGGTGAGCCTTNTCATTAAATG 

CTGTCTCTTGAGCAATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAMGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTGGGTCGAAGCTTGTACACCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAAAATNGCNCNC 

TGGGGCNAAATCNNGGGAAAATANGCTGTTTTCNTGGGAAAAAAATTGTTTTNCCCCGCA 

ANAAAAATCCCCCACAAAAATTNTANNGNNNGGGNGAAAAAAAAAAGNAAAAAANNGNGG 

GGGGGGGCCCCCAANNGGGGGGNNGCCCNCNCCCCTNATANTTNATGGNGGGGGGGGGCC 

CTCNANNNGGCCCCTTTTTNNNNMNNGGGAAAMCCTTNGGCGGGNCCCNTTTNTATNA 

ANAAAAMCCCNCCCCCCCCCCCNNGGGGGGGGNGGGGNGGTTTNGNTTATTGGGGGGCC 

CNTTTTTCCNNTTTTCCNAANAAAAANAAAANANNCGGNGNCCNGGAGGGGGAAANGNGG 

GGGGGGGG 

Sequence 377 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGTGATG 

CTGGCTTCCCGGTCMAGCTGAGGAGTTTGTGGTGCTTTCTCAGGAACCTTCTGTCACGG 

MACCATTGCACCCAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAG 

TATTGAGTATCTGGAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACA 

CACTGATGCTCTGTCATCAGACTCTGAGAApATGCCGTGTGATGAAGAACCATCCCAATT 

AGAGGAGCTAGCTGACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTA 

CCTGGAMTATTNCATACTTTNTATTTGNGGCNAAAAAAAA 

Sequence 378 

GCCGGCCCGAGGACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAA 

AAAAAAAAAGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGGGGCCCCGGGTACCCAAGCTTTTG 

TTCCCTTTAGTGGAGGGGGTTMTTTGCCGCCGCNTTGGGCGTAAATCAATGGGTCATTA 

GCTGTTTCCTGNGNTGGAAAAATTGTTATCCCGCTCACCAAATTCCACACCAAACCATTA 

CCGGAAGCCCCGGGNGNAGCCATTAAAAAAGGTGTNANANAGCCTTGGGGGGGGTGCCCT 

ANATN GGAA GGTGGAGCCTAAACCTCACCATTNAATTTGGCCGTTGGCGCCTCACCTTGC 

CCCGCTTTTCCAAGGTCCGGGGGGAAAAACCTGGTCCGNGGCCAAGCTTGCAATTAAATT 

GGAAATTCGGGCCCA 

Sequence 379 
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GAATTGNAGCTCCCCGCGGTGGCGGCCNNCCGGNCAGGTGGAAAGGTGGGTGGGGAGAGG 

GAGGCTTATTTGTTGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAA 

GGTATATTTGGTTAGATCTAGGTGAGTTCTACTTTAGAGGAAATCCTGGTAACTGTTGTT 

TGTTTGTAAGTTATAGCTGTAATTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCAT 

TCAGATACTATGCATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTA 

AGCTATTATTCAAGGAGTATCTACCATCATAAAGGTGGTTTAAGTCATATAGGATAATAT 

CMTCAATMTACAGGGAGATGGCAAAMTTTTTTGGGNAAANCCCAATATTANCTTGGG 

TTTNATGACCCCCNMTCTCACACTTTGGGGNCNTATGGGMAGGCTTTTTTTAAGACCC 

GGGAGTTCAAGACCNGCCCTGGGCAACATTAAAAAACCTCCTTTNNNCCAAAANCTTTAA 

MAA 

Sequence 380 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGMTATCCAATTCATGTGAACTACAGGAMTTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATMCACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTMTAAAAGTCMATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 381 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GCATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAA 

TTGATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGT 

ATTAAATCCTCCAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGA 

GGTGTTTTTTTGTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGA 

GTTAGGTAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTG 

GATTTCAATTCGGCACTTTATGTATTAGCTMAMTTTCATGACCAGATCTTTTGAAGTA 

TACAAAGTAAATCTTCAAGGTGGATAGTTTATCCAAGTGTAAAATGTGTTGCACTAGGTC 

AGCTTGGMTTTTGAGATGACTTTTGGCATCATTGCATACATCTGGTTTGTGTACCTGCC 

CGGGCCGGCCGCTCTAGAACNGTGGATCCCC 

Sequence 382 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCA 

GTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACAAA 

TNGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTC 

CCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGG 

TATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 383 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCTGCATATACACCAT 

TAATTTTACATCGTTGAGGTAGCCAAAACTGCTCGTAAGTGGGCTTTTATTAAATAATAT 

MTGTTCTTAATAGAGGAAAAAGGAATTGAATACATTTTTAAAAACAAAATAACAAAACC 

AATC CATTGTCCCACAAAAGAAAATCAGTGGAGACAAAAGCAGTTTAATTTGCTGGATTC 

TTTTGTGGCTTAI I I I I I GAGTATTATTTACAAAATGTTAGACTAATTTTTAAGCAATAT 

TAATAATAAGCAACATACAACTCCAAGAATAATATAATAAATAATAAACTGCGGACGCGT 

GGTTCGAAGCTTGCTCGNNGGGGGCCGGGNCGCTTCNAGGCCCNCCCGGGCAGGTACCCA 

GTNATCACATAAATTCTGCMTCATNTGG^ATTNAGCTTNACNTGNTTTTTTTATTTGN 

NGAANTTGTTGTTGTATTCAG 

Sequence 384 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACC 

CACG CGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATAT 

TTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCT 

TTACCATATCATTMTAMAGTCAAATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGT 

GGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 385 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCA CI I 1 1 1 I 1 1 1 1 1 I I I I I I I 
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TTAAGAAATAGGGTCTCACTCTGTCCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTT 

CTGACTCCTAGGCTCAAGGGATCCTGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGG 

CAGGAGATCGCTTGAACCGGGAGGCGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTT 

TCCAGCCTGGGTGCCAGAGCAAAACTCTGTCTCAAAAAAAAAAAAAATAATAATAATAAA 

TAAATAAAAAGGCAAGGAATATAGGGAAAAGTCAAAAGAGATGGACTGTGAGAAGACTGG 

GAAAGCCAGAAGAATGGNGGAAAATGTAGCATGGAGTAAGACAATAAAAATATAAGAGGA 

CTCATTTCGGACGCGTGGGTCGACTCACCTCGGCCGCTCTAGAACTAGTG 

Sequence 386 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGCTTCGACC 

CACGCGTCCGGGAMTTTTAATTAAAAATAGGTGAACATTTTAAATGACCTAATACATAT 

TTAGTCCACATTGAMCTTTGGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAA 

ATTTGATGCCATTAAAATTTTGATCAGTAGGTAGCA I Mill I CTTAGCTACAATTGTTT 

TTTTTAATTATAAGTATTAAAAATTCATGAAGATGATTCTTTTTGTAAAACAGTTTTGCA 

TAAAAAGTAAGTCTCATTTTAAAGCAACTACCAACTTACTGGCCACCT 

Sequence 387 

AGGTCTTCGACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAATGCATCGATATTACA 

GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATMCACAGGMGTTTTTA 

GAGTATGTCCACTCTGGTTTACTCCTTTGTAAGTATTAATACCTGATAATTTACATCCTA 

CAGCCCTGCCI I I I 1 1 I I I I I I I I ICAAGTTTGTCCCAGCAAGTCTTGGCCCTTTGCATT 

TTCTTAATACATTTTAGTACCTGCCCG 

Sequence 388 

CCGCGGTGGCGGCCGAGGTACAAAGAACAAAGGGAAGCTAAGGAAGAAAAGATAGTCAAT 

AAAAGATGTCTCATCTGGGCTTAGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCT 

GAGGCTCGAGGACTGCTTGAGTCCAGGAATTTGGGCAAGTAGGAAATTACTGAACAGCTG 

CTATCACAGACAAATGCCTAACATTGTGAAGTGCTACACAGGGGAAGGAGACCCACGCTA 

AGAGGAGAGCATGCACCCAGACACAGAACTCAGAGGACACAGTTCAAAACACACATACAA 

GAGGCTTAGGCACCTGTGGGCGTGTGTGTGCTCACAGCCAGCAAAATGAAAAAAAATCCC 

AGCTCTGAAGGAGAGGCAAGTGCATGGCTTCCGTACCTGCCCG 

Sequence 389 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGA 

TGGTTTTTGCAAAAATTGAAAATGCATCGATATTACAGTTMTTTTTTCAGTGNGTATGT 

GGTATTAGGCTTAGMCTATMCACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACT 

CCTTTGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGC CI I I I I 1 1 I I I I I I 

TCAAGTTTGTCCCAGCANGTCTTGGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGC 

CCGGGCGGCCGGCCGCCCGGGCAGGTACNACTACCTCTTTAAAGTTGTCCTTATTGGAGA 

TTCTGGTGTTGGAAANAGNAATCTCCTGTCTCGATTTACTANGAAATGAGTTTAATCTGG 

AAAGCAAGAGCACCATTGGAGTAGAGTTTGCAACANNGANGCATCCAGGTTGATGGAAAA 

Sequence 390 

TCCCCGNGGTGGCGGCCGAGGTACTATTGACTAAAGTCAAGTTGGGGGAGAGAGAGGCGG 

AAGTATATTACTTTTNTGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAAC 

ATTNNATCATTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTNTGAGGAGG 

TTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGG 

TAGATATTTGCACTGGT 

Sequence 391 

AGGG CGAATTGGAGCTNNCCGCGGTGGCGGCCGAGGTCTTCNACCCACGCGTCCGATGGT 
TTTTGCAAAAATTGAAMTGCATCGATATTACAGTTMTTTTTTCAGTGTGTATGTGGTA 
TTAGGCTTAGAACTATAACACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTT 
TGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCC I I I I I I I I I II I I I I CA 
AGTTTGTCCCANCAAGTCTTGGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 
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Sequence 392 

CCGCGGTGGCGGCCGCAC I I I II I I I I II I I I I II I ACCTGAAAATGCTTATTCTAGCTT 

CACATTTGATTGTTTGGCTAAGAAGAAMTTATTTATTAGACTTMTTTTCCTCACG 

TTAMGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTAGACGCT7TCAGGAC 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 393 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTMGTGTGACCATGAGGTGTTTTTTTGTGGACCATTTTA 

AAGCCAATGATAG 

Sequence 394 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCTACCCACGCGTCC 

GCACATTTTGATGGTCAGTCAATAACTTAAGCAGTTACCAAAATACTAGGTATCCAAGGA 

GCGAGAGGTGGGCGAGCATAAGAAACACATTTCTCATGGCACAGCTCTGCCAAAGCCCTG 

CAGAATCATTTACACATAGGTCTTTGGTTAGTAGCCCCTGGCACAGAATTCTGATCTTAA 

ACAAATATTGTCTATAATCMGTAGAGCAATGCMTTAAAAAAAAAAAGCACAGGTTTTG 

GGGCCATGCTGAAATCCCAGCCTTGCTATTTGCTGGCTGTGTGACCGTGGTTCCTTGGTC 

TCATTATGCTTTGGTTCCCGTATCTATAAAACGGACGTAATAATGTCTCCCTCTCATTAT 

TGTGAAGTCGAAATGATGTCTGTAAAGTGCCCAACACAGTACTAAAGGGCTATT 

Sequence 395 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTTTAAGTAACAT 

T CAGAT TTGTGTGTGTGGAGAGGTTGTAGGGAACAGAATTGTAGGAAGGTGCTCACACCT 

GTTTTTGTTTGTTTGTTTTATGTATATATC 

GTMGGTATTTGAGCACTCTTGTTTATCTTGTGTAGGTGCCAACCAAATATTTTTATAGA 

GATGTGGTTAAGCCTCTTGGCATGTTCAACTGTGTACCT 

Sequence 396 

CCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATA 
CAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAA 
NGANGGAGAATGAAAACTTCCGGACGCNTGGGTCGAAGCTTGACCT 
Sequence 397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCGCGCGTCCGCATTT 

ATTMGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATC 

TTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAA 

TCCTCCAGAGAAGCCTGTAGTGTGGGATGCAAACTAT7TTAAGTGTGACCATGAGGTGTT . 

1MII GTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGG 

TAGMTGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTC 

MTTCGGCACTTTATGTATTAGCTAAAAATTTCATGACCAGATCTTTTGAAGTATACAAA 

GTMATCTTCAAGGTGATAGTTTATCCAAGTGTAAATGTGTTGCACTAGGTCAGCTTGGA 

ATTTTGAGAT 

Sequence 398 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCATGTTGGCTGGGCTGGT 
CTCGAACTCCTGACCTCAAGTGATCTGTCCTGGCCTCCCAAAGTGTTGGGATTACAGGCT 
ACTAMGCTTTTTATTTATTTCTGTGGA I I I IC 

A TATG MGGATTCCTTTTGGTATTTTTTGAAGAACTTCTTTATCAGMTTMTTA 
CTTTTGAAAATTTTTAACMTCTAG AAACGTCTTTTTTG I I I I I I GAAGAATAGTTTTCT 
GGATGTAAAATTTTTGGTTAATTTTGTTTTTCTTTGMC^ 
CTTCTGGCTTTCAGTACCTGCCCG 
Sequence 399 

CCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAGATCCGTTTCAGA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

70/467 



MCGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACCTGATCTTATTCT 

CATGTTTAACCTTGGCAGTGGGMGTTCTTCCTGGTATCCTGCCTAATTTACTGGAGTTG 

GCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCACCTGAGAATTTGA 

TAGACATAGACCCAGAGTTACTGAGGCAGGTGCTCTGTTTTGGGGACCAGCAATCGGTGC 

TTTAGCAAGTTCTTTGGGTGATAGGGTTTGGAAACTACTGCTCTMAGCATCATCTGTTT 

TGACTTTGCCATGCACAATCTGAACTCACTCCCGTGAGGCCCTGCTCCTGATACTTTAAA 

TCGTCCTGTCTCTTTTTCTGCCTCTCTGTGGAG 

Sequence 400 

CCGGGCAGGT ACAGGC ACCTATAGMTTTMAGGGGAGATTTCTTTATTTTGTATTCAAT 

GTATTAATMGATTTTTMAACATATTTTGGAGAMTTGCTAATTAGTGTATAATCCTGA 

TGCCAATTCTAAAAAACC I I I I I II 1 I I I I GTAGAGACAGGGTCTTATTCTGTCACCCGG 

GCTGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAA 

TCCTCCCACCTC AGCCT CCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCT 

MCCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCTGGTCTTGAAC 

TCCTAACCTCAAGCAAACATCCCTCCTCGTGCTCCCAAAATGCTNGGATTACCAGCATTA 

AGCCTTACMGCATAAGCTACCATGGACTGGCTTTCNAAAAMTATTTGGTTTAAMTTC 

Sequence 401 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATNNATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGA GTTCT ACTTTAGAGGAMTCCTGGNMCTGTTGTTTGTTTGTAAGTTA 

TAGCTGTMTTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAA 

GGAGTATCTACCA TCATAA AGGTGGTTTAGTCATATAGATAATATCAATCAATAATACAG 

GAGATGGCAAAMTTTTTTGTGAAGAGCCAGATAGTANCTGAGTATGATGACCCCTAATC 

TCAGCACTTTGGGAGGCTGATGGGAGAGGGTCATTTAAGACCAGGAGTTCAAAGACCAGC 

CTGGGCAACATTAAAAMCTCCATTTCTACCAAAAACTTTAAAAAAAATTAGC 

Sequence 402 

GCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGGGAGGACCTAGGCAACGGCC 

TGAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAACCTACTGAAGATACGTGGTTT 

TACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGCCATCTGAGACCAGCCATCTGG 

CAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATCCCACTGACACCACGTAGAGCA 

CAAATGAGTT GCCT CCACTGAGCCCAGCCCAAATTGTTATCCTATAAAATCATAAAAACA 

TAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCG 

Sequence 403 

TACTATAGGGCGAATNGNAGCTNCCCGCGGTGGCGGNCGAGGTATTCAACAAGGGCCCTG 

AGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGT 

GTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCA 

GGGAGGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACT 

AAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTT 

TAAAAGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTG 

CCCG 

Sequence 404 

CCGCGGTGGCGGCCGAGGTCAAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGT 

CAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGNACCTTCTGTCACGGAAACCATTGCACC 

CMMTTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTG 

GAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTG 

TCATCAGACTCTGAGAACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCT 

GACTTCATGGAGCAGCTTACACCAATTGAMAATATGCTTTAAATTACCTGGAATTATTC 

CATACTTCTATTGAGCAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 

Sequence 405 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 
GGAAGTTTTC ATTCT CCCTC I I I I I I I I I I 1 1 1 1 I I I I 1 1 AGCAATTCAAGNCATGTTTT 
TGTCTACAAGTTTTTCCAGTATTGTATAGATAAATAATAATTTACTAGGCTGCCTTGAGT 
ATACTAGACAAGAGACCTGCCCG 
Sequence 406 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I 1 1 I I I I 1 1 I I I I I I AC 

CTGAAAATGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATT 

AGACTTAATTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAA 

AGCTGAGAAGTCCMTGCCATTCTGATTCTTGCAACTTACMGTAGTCTTTTTTTGTCTA 

GACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATC 

TTTTGGGTACCT 

Sequence 407 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I I I I I CTCTTATATTGAAGTAAAATTTA 
MATTTAATACTTTTTATTTTTTAAAAGCATGTATGGCATCATTTCAGTCTTATTAAATT 
CTCTCTGCATCCATTCACCCATCCTTCTTTTTGTGTGTGTGTGTAGTGGTCTCTGTGAGA 
1 GGGT TCATTAAT GTCAA TCCTGATCATTTCTTCCTCAAGAGATGTCAGTAGATTTGTTTT 
TTTTGCTTTGGACTTTTATGAATTGATTGMTTTTTATGCCAATTATTTTTAAAGTATTA 
CATAGAAGAACAAATGGACAGAAAAATTTAAATGCAATCAAATCTTGTTGATTTTGAAGT 
ATAGGAAATAATC I I I I I I I I ATTATACTTTAAGTTTTAGGGTACCTGCCCGGGCGGCCG 
CTCTAGAACTAG 
Sequence 408 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 409 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGAATATCCAATTCATGTGMCTACAGGAAATTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTMTAAMGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 410 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCAGTGCAAATATCTACCCAGT 

TAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTCCT 

CAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTTCT 

TTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGCCT 

CTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 41 1 

ACTT AGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACTTTTTTCATTGATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGNNNTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAATTCCATGGAATTCCTTTTGAAGTTTATGAAGTACCTGCCCG 

Sequence 412 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGG 

Sequence 413 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACTTGTGATGCTT 

TTGGCAGGAATTACAGAACAACCAATGCCATTCAAGTTGTGGAGATTATACTNGCAGGTG 

AACTCGTAAAGAGAAGATTCTGGAATGCCTATATCTGAAAGCTTGAGTCGACACCTN 
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Sequence 414 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGAATGAGTGAGTGCAGAACTGGC 

AACCAGAAGACAGGAACAAGGCCTGGGAATGGAGCGGAAAGGTAGCTGCTATATATAGTT 

CCTTCAGCCAGTMCGATTAGAGCCAATAGCCATCTGGATGATGAATGGCTCCTAATTGC 

CTTAAATTACGGCAGTTAGCTAAGGGTTTCTGTTGCTACATGGGTTACCGTAGGCCGCTG 

CACCCTGCATAACTGTCCTCAGGCCTGCGTCCCCTGAGTCTCAGCACTTGGGCCTCCACC 

TGCCCG 

Sequence 41 5 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAA 

TCACAGGAATGAGCCACCACACCCAGCCAAATTGGGCACAAATTTAAAATTTGACTTTTA 

TTAATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAG 

GCAAATCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTC 

TGAAGCGGCCCCNTGGGTCGACTTTGTAACCTGCCCGGGCGGCCGNTCTAGAACTAGTGG 

GATCCC 

Sequence 41 6 

TATGGCGAATTGGAGCTCCCCGCGGTGGCGGGNCGAGGTNAAGCTTCGACCCACGCGTCC 
GATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCNTATT 
CTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 417 

CCGCGGTGGCGGCCGAGGTACTCTTGATGTCATAAGATTAGAAAATGTGGTTAATTGTCA 

TCAACCCATTAAGTTCTTAAATGTCATTGAATGGAGTCCTTGTCATGTTACAGAGGAGCG 

TAAAATTGTGGTTAAACATTTTTTTAAAGATTACATGGTAGAGCCACAGTTTGTTATGCA 

GAAGGAAAATTTAGCAAATATTATTTTGCTTAATAGCCTTTAAAAAATCGTATAAATTTG 

ATTTGTAGTTTTATCCCCAGAGTCATTAGATTTTTCCAAAAAAAAAAAAAAAAAAAGGT 

Sequence 418 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCT 

TAACTTAGATTGTTACTTATGTTTCTAAATCTGNGGAAGCACATTTCCTTTTNTTNTNNT 

TTTCTTTTACTGTTMTATCCTTATTCTCTATTTTACCAGTGGAGAATGNTTAGTATTAA 

TTTCCATTTANCTCANGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAAAG 

CCTACTGTCTTCTGNTTTGGMGTCCCACMCAGCTCTTTMTTTCCTTAAGCCCCACTT 

TCCTCATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGA 

GCGGACCGCGTGGGTCGAAGCTTGTACCTN 

Sequence 41 9 

CCGCGGTGGCGGCCGCACTTTTTTTGTATTACTTCMCTTTTAAAMTTCTAAAGAAAAC 

CATCATCTCAGACCAGCATTTCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 420 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCT 

CCCTCGTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGC 

AGGTGATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTG 

AACTGGAAAGCACCT 

Sequence 421 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGAAGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

AACCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTNATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGGATTTAATACTTTGG 

Sequence 422 

GGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTCGTCCGGATCCTTCTAACGCAGG 
AGTCTCAGACGCAAATGCCGGCAAGGGCCAGGCAGGTGATGTAAGATGCGTGGAGCAGAT 
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GCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTGGAAAGCACCT 
Sequence 423 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATAC 

AGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGT 

AGAATGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACT 

TTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGT 

AGGATACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAA 

GCTATCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACA 

GAAAATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 424 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGAC 

CCACGCGTCCGATTTGGCAGAAATTCAGGGTAATGTCAAGGTTCTTAAATCTGAGAGAGA 

CMGATCTTCCTTCTTTATGAACAGGCACAGGAAGAAATTACCCGACTTCGACGAGAAAT 

GATGAAAAGCTGTAAGAGTCCTAAATCAACAACGGCACATGCTATTCTCCGGCGAGTGGA 

GACTGAAAGAGATGTANCCTTTACTGATTTACGAAGAATGACCACAGAACGAGATAGTCT 

AAGGGAGAGGCTAAAGATTGCTCAAGAGACAGCATTTAATGAGAAGGCTCACCTGGAACA 

AAGGATAGAGGAGCTGGANGNCCNTCCCGGGGGCCGGGGNCNGCCCGCCCCNNGCAGGGT 

CANATGATTGCAGAATTTATGTGATTCCTGGGGT 

Sequence 425 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACTGAGCCACTTA 

CMTTATT TCTG AAAAATCTCAGAGAACTGAGGATAGATCAGAAAATTTAAAGAAAGCAA 

ATACCAAATTTTCAMCCAGGATAGGAAGTGAATACCTTGTAAATACACTTTGTTAAGTG 

ATGATMTTCTGAGTMAAATTTAGMGATTTTGAGAMAGCATTTGAACTTCTAGGGGC 

CMTAAAATACCATGCAGMGAATGTTTAAAAAGTCATGCCAAATTTGAATCCATTTGAT 

CCTCAACCTCATCAGATGTTATATGCCAAACTACTTATTTTGGCTTAGATAATAATCATA 

TAGAATGAAACTTTCCACAAATAGACTGTGGTCAGTGGCTG 

Sequence 426 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCA 

AMACMGTTTGGTCCCCCTGTTATAGNGCCTGGTAMGGTTTTTGTTGTTGTTTTGCAG 

GGGTGGGGGAACCAGGAAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGT 

AACTGCTACAGCAMGGGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGT 

CGGA 

Sequence 427 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTC 

GTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGNAGGTG 

ATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTG 

GAAAGCACCT 

Sequence 428 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCAC 

GCGTCCGTGATMCTTCTCCTAAGTGCCAGGCATTGTATTACATGTTGGGAGCACAAAGA 

TGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCA 

GGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGCCCTGA 

AATGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCT 

TTACAGCAGCCTCNAAGACTGATCAGGTTACTATACCCTCTCTTNTGTCCACNGTGCATT 

TNAA 

Sequence 429 

CCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCAAAAACAAGTTTGGTCC 

CCCTGTTATAGAGNCTGGTAAAGGTTTTTGTTGTTGTTTTGCAGGGGTGGGGGAACCAGG 

AAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGTAACTGCTACAGCAAAG 

GGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGTCGGA 

Sequence 430 
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CCGGGCAGGTACACTCCAGCTCCTCTATCCCTTGTTCCAGGTGAGCCTTCTCATTAAATG 

CTGTGTCTTGAGCNATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGMGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGMGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTAGGTCGAAGCTTGTACACCTCGGCCGCTCT 

AGAACTAGTGGGATCCCCCGG 

Sequence 431 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGATTTAAAATCTC 

CTCCTCCTACAGCGGTCGAGTATTGAAGCAGGTCTTTGAGGATGGGCNGGAATTAGAGTC 

ACCAAAGGAGGAATACCCTCACAGTTTTCTGCAAGAGTCTCTTGAAACAATGGATGGTGT 

TTATGGGTCTGGGGAAGACCCCNCGCCCCAAATGTTGCTCCCCT 

Sequence 432 

CTTAGGGCGAA TTGGA GCTCC CCGC GGTGGCGGCCGAGGTACACACAAACACACACACAA 
AGTTTAATATACTTTTTAAAAAATTn'ATTGTATTGTT^ 

TGTTGCCTAGGCTGGTCTTGAACTCCTGGGATTAAGCAATCCTCCCAACTAAGCCTTCCA 

AAATGCTGGCATTACAGGTGTGAGCTACCACAATCAGTCTCTTAGATTTTGTTTTTTAAG 

AACA ATTCGAAGTTTACTGCAAAATTGTGAAGAACGAACAGACTGTTCCCACATATCCCT 

TTTTCTTTACACACCGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 433 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGCTTGAGGTGGG 

TTTAGGAAACATTTGGTATCTNTGGCAGGGACAGATGTTGACCTGGCCGGTCGGCAGCTT 

TTACAAACCTAAGGACT TCAGG GTCCGGTTGCGCATGAGGACCGGGGAGGACAGAGCTGT 

TTGCMTAGGTGTGGGCTTTTATAGCATTGTGAGCATTTCACGTTAGCGTAAGTGTTGCT 

GCTGTGCAGGTGGTCTCTGGGGCTTACAATCTTCCCCAATGTTCTTCCCCACCCCTCCCA 

CCATTCTGGTGAACAAGCCTCTTGGGATTCTTTGAAAAAAAAAAAAAAAAAAACCT 

Sequence 434 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATACAGAG 

GTATAATCTGTAACATCAATAATGTAMGTGGGGAAGGGCAAGGTGGAAAAGGAGNNGAA 

TGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACTTTAG 

AATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGTAGGA 

TACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTA 

TCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACAGAAA 

ATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 435 

CAGGTACAGGCACCTATATGMTTTAMCGGGGAAGATTTCTTTATTTTGTATTCAATGT 

ATNMTMGATTNTTAAMCATATTTTGGAGAAATNGCTAATTAGTGTATAATCCTGATG 

CCAATTCTAAAAAACC I I I I I I I I I I I I GNAGAGACAGGGTNTTATTCTGTCACCCGGGC 

TGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAATC 

CTCCCACCTCAGCCTCCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCTAA 

CCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCT 

Sequence 436 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCAACAAGGGCCCTGAGAG 

AGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGTGTGA 

GGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCAGGGA 

GGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACTAAAG 

CCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTTTAAA 

AGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTGCCCG 

Sequence 437 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCACGTAGCAAC 
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ATATGAGTATTTCTCTAGATAACI I I I I I I 1 1 GACAAGGTCTCACTCTGTTGCCCAGGCT 

GGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTTGACCTTCCCTAGCTCAGCTGAAC 

CTCCCATCTCAGGACACCATTGCCTCCACTGCCCATCCTGCATCTGCCTGCCTACCCCAA 

AAGTGTTGAGAATACAAGCATGAGCCAGAGCCACGGAACCTGGCCTCTAGAGAGACTTTC 

TATTTTAGI I 1 1 I ICTTCTCTTATTTGTGAAGCCTTGAAAAAACTACTGTGGTTTATTTA 

GATTCTGGTTTGTGAC I I I I I I AAATAMCTTTTTATTTTGGAATAAATTTATGTTTAGA 

GMTAGTTGCAMCATAATAMGTGAGTTTTCATAAACGCCTTACCAGTTTCCCCTGNTG 

GTTMCATTTTACATCACCATGCTGTTGCA7TGGTCAAAACTA 

Sequence 438 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTGI I 1 1 1 1 1 GTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

TTAGCTAAAAATT 

Sequence 439 

TCGAGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTC 

AAAATNGCAAGCNGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGA 

TTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTNAGCTAATACATAAAGTGCC 

GAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCA 

TTCTACCTAACTCANGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGNCCAC 

AAAAAAACACCTTATGGTCACACTTAAAA 

Sequence 440 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTA 

TAATAGTAATAATAAAAAAAGAGGTGCTTTTCTCCTAAGTCAACATTTTAGAGGAAAAGA 

GTCAATTCAAGCAATTATCACATATGTGTAACTGAAGCACATATGTGTAACTTTTCAAGA 

GTGATTAGATGGTCTGTTGTCTTTGAAGTGATAGTCAAATATCAGGTGTGTTCTAGGGAG 

GTTGTGTAAGACTTTTGCTTGTATTCTCCCGGACGCGTGAGTCGACTCAAGACCTGCCCG 

Sequence 441 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTNTCTAATGTC 
ATATTCTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 442 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCACG 

CGTCCGCTTCAGMTATCCMTTCATGTGMCTACAGGAMTTATAGTTTAGATATTTTT 

AAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTCA 

TATCATTMTAAMGTCAAATTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCA 

TTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 443 

CGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTC 

CAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTT 

TGTATACTTCAAAAGATCTGGTCATGAMTTTTTAGCTAATACATAAAGTGCCGAATTGA 

AATCCAGAATACAATAGCTATGAAGGGCCGNTAGAGTGCAGATAACCAACCCATTCTACC 

TAACTCAGGTTGAGATTGCTTTANAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAA 

CACCTCATGGTCACACTTAAA 

Sequence 444 

ACNGNCAGGTACCAAGATTAAGGACAGAGTTCCCTCCATTGGTCATTGATTTGNAAACCA 

AAATGTATCTGTGACAGGTATTAATCCGGACGCGTGGTCGAAGACGAAAGGACACGAGAA 

ATANGGACCTANNCCGCTCTANAACTAGGNATCCCCNNNNCTGCAGGAATTCGATATCA 
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Sequence 445 

CCGCGGTGGCGGCCCGCCCGGGCAGGTGCCMAGCCTTAACTTAGATTGTTACTTATGTT 

TCTMATCTGTGGAAGCACATTTCCTTTTCTTCTTCTTTTCTTTTACTGTTAATATCC7T 

ATTCTCTATTTTACCAGTGGAGMTGTTTAGTATAATTTCCATTTANCTCAAGATTCAAG 

AMTGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTACTGTCTTCTGCTTTGGAAGT 

CCCACAACAGCTCTTTAATTTTCTTAAGCCCCACTTTCCTCATCANCAAGTNGGTGTGGC 

AATGGATCATAATAGGTTGC 

Sequence 446 

CGGGCAGGTACCCTTATTTTCCCTGATACAGCACAACTCTGCCTATTCTAATCATGACCT 
AGACACATTCAATGAACTACAAGTTCTCCTTTACACGGACGCGTGGGTCGACTCCGGACG 
CGTGGGTCGAAGACCTCGGCCGCCT 
Sequence 447 

CGAGGTCTTCGACCCACGCGTCCGCTTTCTTTTGTTGTTAAGGACCCATGACCTGCAGTT 

TCCCTAACATTCATTTTTATACAGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAAT 

TCATATGCCTTTAATACAATCAGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGG 

AGAGTGAAAGTGGGCCAGCCTAAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTG 

TGGCTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATCATGNTCATAGCTGNTTNCTGGG 

TNAAAATNGNTANTCCGNTCANAATTTTCACACANfCNTANNAACCGTTNCCTTNAATTG 

TTNANN CCNGG GGTNCCNAATTGGGNGNNCTNACTNAAATNAANTNGGGTNGCTTNTNTT 

GNNNNNTTTTTANNTNNGTTTANCCTTNTNNNCANNTTNNTTNTTNAAATTNGNNAAANN 

CCNNGNGGGGAGGGNNNNTTTNNNNNNTNNNGNNANNNTNNNNGNNNNNNNGGNNNNNNN 

TTNATTNNNNTCTNTNNNNNTNNTNNNNNNTANNNNANTTTNNNNTTNNNNNNNNNNGGN 

NNNNNNNNNNNNNTNNANNTTTANNNNAANNNNGGNNNNTNNNNNTTNNANTNNNNTTNG 

NGTNGTTNTNTNNNNNNNANNIMTNTNTTNNNNNNNNNNNNTNTNNNNTTTNNTTTNNNNA 

NGNNNTTNTTN I I I 1 1 I 1 1 

Sequence 448 

GCGGCCGCCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATG 

TGAACTACAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACA 

CMGGGTGTCATGACCAAGCTAGATCTCTTTCATATCATTAATAAAAGTCAAATTTTAAA 

TTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGA 

CCT 

Sequence 449 

CGCGGTGGCGGCCCGACCAGTGCAAATATCTACCCAGTTAGAAGAGTAAATACCATCTTA 

GTGTTATTATCAAAATATTCTGAACTCATGAACCTCCTCAGACTGTTGCTGGGACTCCCA 

GATATCAATACTCTGAGAACCACTGATCTAATGTCTTTAGTCAGTTTCTATTTGTTCTCT 

AGTATAACCAAGCATAAAAGTAATATACTTCCGCCTCTCTCTCCCCCAACTGACTTTAGT 

CAATAGTACCTCGGCCG 

Sequence 450 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTGTCAAGCTTCGACCCACG 

CGTCCGAAAGAGACCCCTGGGCTAGGGAGCCATTTGCAAGTCCCTCAGATGGCCTCAGGT 

GTGGGTCATGCTCCACTGCAGCCATGGTTACTGGGAAGCCAAGTGGACTCCATGGGGCTG 

CAAGCAGGGGAGGGAGCTCCTGGGAGACACAGTGTGGCCCAGATGCCCTGGTCATGGGGA 

CATCCAAGCACAGGTGAGAAACTTGGGCAGGGAGGGTGCAGAGGAAATTTGGAAATGGCT 

GAGAGGCTGGACTCAAGGTCTCTTGGGCTTGAACTGAGCTCAAGCAGTCCACCCACTTCA 

GCTTCCAGGGTGCTGGGATTACAGTCGTGAGCCATGGCATCCGGCTGGACTCAAGGTCTC 

TTGTTCCCTGATCCTGGGCTTGGCATGGAGAAGGGAGGAAGCTGAAGGGGGCAGGTAAAA 

AGAT 

Sequence 451 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 
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TCCGATTATACCCTAAAAAAGTAGAAAGATGGAATAATTAGGAGTAAAATTAGTGAAATG 
GAAAGATATGTTATAGAAAGGACCAAGAAACGCAAAAGTTGGTATTTGAAAAGACTGAAA 
A AATT CATGAACCTGTGAAAACAGTGGTCAAGAAGAAAAGAGAGGCATATTTTGATCTCT 
GTTTTACATGTTACTCAATGTTCATTGCTGCCTCCCTTGTCCATAAAGTGCCTTTAGTGT 
GTATGTTACTTTAGATTATCTTGGTGTCATCAAGCTTTACTCAGCAAAGAACCACTTTGT ' 
TGTCTACTTTAAAACATAAGTTATCTTTAAAAGAATGGGTATCTTTTATAGTTCCATATT 
AATGGCGAAGAAACTGCAGGTAACAGTGCCTTACCAGCTGGGTTTTGCTAACTTTTCTC 
Sequence 452 

CCGCGGTGGCGGCCCGCCCGGGCAGGTTTTATA I I I I I I I CCTCTTTAAAAAATAATTTG 

GTTTTGAATATTAAATTTACATATTTCTAAGTTAAATCAACATTCGTAGAGGAATTATCA 

AAAAAAACTAGTAAGTCTGAAAAAAAAACCATATTTTATATTCTGAGGTCCCGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 453 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACCCAACACAAACTA 

TTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCA 

CCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAA 

AATCAACATTAGCTGGGCGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGA 

GGCAGGAGAATCACTTGAAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATT 

GCACTGCAGCCTGGGCAACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAAT 

AAACACAACAGAATTATTCTGCAAATACAGATATTGGAGTAGCTGAGTTCCATCTCAAAT 

TTGA CTATGCAGGTTGACAGGTGATCTTGGCAAACTACTTATCCTTTCTGAAGTTCAACT 

TTTTCACCAAATGGTATTGGGATACAACACTTGCTCTTGCCTATCTCACATGAATTATCC 

ATTTTGGACAACTTGGTAAACTATA 

Sequence 454 

CCGCGGTGGCGGCCCGAGGTCCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTAAGATCTGAAGCAATCT 

TTAAACTCGTGAGGAAAATTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAAA 

TGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 455 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTACAGCATCCTGATAA 

CAGCCTCTGCCCTGGGAAACAGACTGTGACCATGCATTTCTAGTCCAGCATATCCTATCA 

GAAGACCAAATGGCTTCATCAAAAACAGAGTGACAACCTCTTCTCTTTGCCTCTTCTGTG 

CTTGTTAACAGGCAGCATTGGGGCAGGAGAGCCTGCAGGCCTTTCACGGCTGCTTGAGTT 

CTCACCTGTTTGTCTGAGCTCTGATTCCTCTGCCCTGTAAGCGTAAAGGAGATGTGCTGA 

GTGGAAAGACCTCTAAACAGGCAGCCAGGAAGCCAGATTTCAGGTCCATCTCTGCCTCTA 

ACTGGCAGCTTTGCCTTGGGTAAATCATCAAGTGGGCAATAGTTTCTCTCCTGTAAAAGG 

AAAAGATTGGGTTTMGATTGTTTCTGAAGTTCTCTCTAGATTTAACCTGGMGGAGTTG 

AAATTGCTAACC 

Sequence 456 

CCGCGGTGGCGGCCGCCCGGCAGGTACAACATTTTACATTTCCAGGGACTGCAAAAATGT 

TAGTTCCTTCCCCCATCATTTAGTTTGAAMTTCTTAGATMTTCTTTGCTGGTAAATTC 

CAACAGAATAGTTAGCACACAGGTTCCACACACACMGTTCTAGATAGGAATCTGMGCA 

CCACMTGAAMGAACATTTAACATCTTTTAAAMTGTTTAATGTTATCAGAAAGATGTT 

TGGTATATGTGTTCCATGCATGCTCCTGCTGGTTCTATTTGAAAAAGAAGTTTTTACAGT 

TATCTGTTGTCACCATATTGTAAACGGACGCGTGGGTCGAAGACCT 

Sequence 457 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACAATAAATAGCA 
TTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACACC 
GCTGCCTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTATCAGGCTTCAGTTTCCTC 
ATTCTAAAATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAGTGG 
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CATCTCCCATTTATTTT^ 

ACATATTTTG1TTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAAAGC 

AAAGGATCAAAGCTGGGGGAAAATACTGGACCATGTGCTTCACCTCATGGTGCCAAATAA 

AGAGAAAATGGGGAGAAGATAGGGACAGATAAAGATCTATTTGCTCGGATTGNGCTCTCA 

TCCTTGGCAACATGTTGACAATGCCCTGGAAATA 

Sequence 458 

CCGCGGTGGCGGCCCGAGAGCTCCAGGACGAAGGTATAAACACAGCAGAGGGCAGAGCCT 

GATTTCAATCAGGGGCTACTCTAAGAAAGGCAGGAAACTAGATAAATACATTTAAAAGAA 

ATTCCTCAGTGGCAGGGACAGTAGAGCAGCAGGGGGAGATCCCAGCACGGACAGGTAACA 

GTGTGATGTGGCAGAAAGGCTTTGGTTGCAAGTGGAGAACAGATGTCTCTGGCTGCCTCT 

GGCAGCTGCCTCCTTCTGGGCCTTGACTTTTCAAAGCCAGGCCAGGCCTCCCCCACCCTG 

GACCACCTGTAGCTGGTTCAGAAGGCCCCAGGCTGGGCTTCATAGATGAAGACACAGCTG 

ACTCAAGTCTCTGGCTCTGTCGGCTCTTGGCCACCTTGCCGCTCCCATAACGGTGTTTCT 

CAGGTCAACCCCTCTTCTCCATTCCTACTTCAATGACCTCAGGTCAGGCCCTTGCCACTT 

CTCTTCTGGACAAAGATGACAGCCCTTCACTGGTATCCTCGTCTNCAACCTAATTTATNC 

TTCACAGTGCTGGCAGAAGTGACATCTTTTAAACACACAACGACCCCN 

Sequence 459 

GCGGCCGAGGTCCGCAC I I I I I I I I I I I I I I I I I I ATTTTTACTCC AG AATTTTCCTTTA 

ATATTTAGGACTCCAATCTTTACTTACAAAATAGCTTTTATTTACGTGCACATGATCGTG 

GTTTCAAAATTTTCTAAGCACTATGCTAAATTTGTCATCAAAACATAACAGATTCCCATC 

TTACAAACATAGTTGCTAGTTGAATGAGTAAMGAGATTTCAAATTTCAATTCAAGGAGG 

CATGTCTAAAAGACCAGACCATTCATTTGATGAAATTGTAAATGCCGATCATCCAACTTA 

ACAGGAACTGCACATTTGTTCCTTTCTAGTTAGAAAAAAATAA 

Sequence 460 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTT 

TGTTGATTGACTGATTTATGCCTCTAAGAGGMCTATCTTTTGATAATATTAAATAAGAT 

GTCCTAATACAAAACTGATAGAGTTCAGAAATAATAAGAATCTCCTGGCCAGGCGTGGTG 

GCTCACGCCTTTAATCCCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGA 

GATTGAGACCATCCTGGCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAA 

TTAGCCCGGGTGTGATGGCGACCT 

Sequence 461 

CGCGGTGGCGGCCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 
CAATATAAG GG CAG ATTG AAATATATACGTG AACGTC ACAAAG ACCAATTACTGCCATTT 
CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 
TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 
Sequence 462 

CCGCGGTGGCGGCCCGAGGTGGMTGTCTGTTTTCACAAMTTTTGTATTTTCTCCTAAT 
AGTATGAGGTNGAAGAAATCTACATCTTCTCAAGTGAGCTTATGATTAACTCGATGAGTT 
TTCTTGCTATTCTCAAATCGGAATNTCCAGACCTGGCTAGAAACTAAAGTCTAAGCCCAT 
TCATTAAAGTCTTGAATTTATTTACTTTNGCCMGAACAGCTATATMMTTAGATTCCT 
CCTGGTATAAAATTGGGTGTTTTCTTAGATATTNGCTATCAAMGTCATTTTTCTTGG 
ATCGGACGCGTGGGTNGAAGCTTGTACACCTGCCCGGGCGGGCGGCCGCAC I I I I I I I I I 
I I I I I I I I I I 
Sequence 463 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAG 

CATACAACTGAGGGTTCACGGATTTCTAAGAAATGCATTTCCCTTGTCTATGTTCATCAG 

CCTTTAATACTTTGGCTACAAGGCATATCAGAGAAAGGGAGGTTAAATTGGGTAATGACA 

AAAGAACATATGTAACTCTGGAAATAGGAAAAATGTTCCAGAAATGGGATCAATGTGCCA 

GCAATMGCATAGTTCATTTCATTTGMMTTCAGTTAAAGAGCCCAATAAACAGTTCCA 

AACCGGACGCGTGGGTCGAAGACCT 

Sequence 464 
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ACTATNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACTCAAGCTTTCAG 

ATATAGGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAAC 

TTGMTGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGCATCACAAGTTGTACCTGCCCG 

Sequence 465 

CCGCGGTGGCGGCCGAGGTAATAGCTAATGCATGCAGGGCTTAATACCGAGGTGATGGGA 

TACTTCATGCTAACTGGGTGTTCCTTAGTGTAATAATTGACTGCCTGCTCCCATGTGGCC 

TCACAGAGTCATAGCTTCTTGCCCCAMTGTTTTTCTGGTTTTCTGTGGAGCCATACAGA 

ACCATAAAGAGAAGTTGTTGAGAAACCGGTTTGACTTTCAAGTCTGATGAGGAGCCTGGT 

TTCCATGAGATTTTCAGCTAAGTTAGCATATTTTTAAAAATTTCAAAATGAACAGAGGAA 

AGGTTGACGATCTTTAGAGCCTTCTGCCAGGCTGAAGTGTTCTGCATCTTCCCTAGACCC 

G GTACCTG CCCG 

Sequence 466 

CCGCGGTGGCGGCCGCCCGGNCAGGTTACAAGCTTCGACCCACGCGTCCGGGAAATTTTA 
ATTA AAMT AGGTGAACATTTTAAATGACCTMTACATATTTAGTCCACATTGAAACTTT 
GGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAAATTTGATGCCATTAAAATT^ 
TGAT 

Sequence 467 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTC 

CTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTT 

CACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAG 

GGGGTTGGTGTTGGAGGGTTGGAGGAATGGAAAAGGCCCTGAAATGCAGGCAGAGAAATG 

ATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTTTACAGCAGCCTCGAAGA 

CTGATCAGGTTACTATACCCTCTCTTCTGTCCACGTGCATTTC 

Sequence 468 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTT 

CCCCAAACAGTAATATGGACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTA 

TTTTGATATGGTMTTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGC 

T GTGTA AAAAGACACCTTGTGGTACCTAAMTAGGTTTATGGTACCTATGGAATTGCTTC 

TAT7TTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATG 

CGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 469 

GACCTCTTTAAGTGAAACTTAGTATGCTMTAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATMGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACMCMAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 470 

GCTCC CCGCGGTGGCGGCCGCCCGGGCANGGTTACAAGCTTCGACCCACGCGTCCGGGAA 

ATTTTMTTA AAMT AGGTGAACATTTTMATGACCTMTACATATTTAGTCCACATTGA 

MCTTTGGCATTTTGTCATTGCCATTAAMTTTTC 

AAATTTTGAT 

Sequence 471 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAA 

TAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAG 

CCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTNGGAGGAATGGANAAAGGCCCTGAAA 

TGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTT 

Sequence 472 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCG 

GAGCGTTGCTTGGATTTCTMTTACTTCTMGNGTAGTTTTATTTAATTT^ 

GAAAAAANAAAATAAAANNAAATGTGCGGGCCCGGCCTGCCCGGGCAGGTNCCACNCGTT 
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CGAAAAMGAMGMAAAAAAAAACTTTCTCTTTGCCANTTCTTCTTCTTCTTTNTT 
Sequence 473 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCAC i 1 I 1 I I I I I II I I I I I I i 

TCCTTTTGAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCA 

ATCCTCCCGCTTTCAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGG 

CCCTTTTCTTTTTCATAGGAGMGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAA 

GAATGTGAACTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACT 

GTTTCTTAGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGAGACTGCTTCTCCTGGTTG 

ACCTCATCAATGCCCAACC 

Sequence 474 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAMAGATCTGGTCATGAMTTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

AACCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTCATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGATTTAATACTTTGGAAGTAGCATCATAAG 

Sequence 475 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCNGTGCAAATATCTACCCA 

GTTAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTC 

CTCAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTT 

CTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGC 

CTCTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 476 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAACGCTTCGACCCACGCGTCC 

GGTTTGGGTGGMTTATAATATTTTAGATMGATTTAAGAGGATTGCTAGATNGGAATGC 

GAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCGCTCTAGAACTAGTGGNTC 

Sequence 477 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTTCCC 

CAAACAGTMTATGGACACTGATTTMCAAGACTTATAAAAMATAAGGCACATTTATTT 

TGATATGGTMTTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGCTGT 

GTAAAAAGACACCTTGTGGTACCTAMATAGGTTTATGGTACCTATGGAATTGCTTCTAT 

TTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATGCGG 

ACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 478 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTATACTAGAA 

GATGCTCCAAGGTTTCAGAAAGGAATTMTTACmCAATTTGCACMTTTAGAACAAAT 

A TCTGG CTTTTCCCTMGCTTAATGATTTTCCATTTCACACAACTAAAATATAATAGCAT 

TATTTTATMTCMGTTTMCTGATGGTCTATGATAGTAGAGCGATTTAGTATTTTGACA 

AAAATCTTATGAGACATGAAGTCATTCAAT7TGCCGGACGCGTGGGTCGACTCAAGCTAG 

ACCTN 

Sequence 479 

T TAGG GCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTTT 

CTTTTGTTGTTAAGGGCCCATGACCTGCAGTTTCCCTMCATTCATTTTTATACAGGGCA 

GAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCCAGGCAG 

ATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCTAAAAGG 

ACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACCTGCCCG 

Sequence 480 ' 

ACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCTTTCTTTTGTTGTTAAGGGCCCATGACCTGCAGTTTCCCTAACATTCATTTTTATA^ 
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AGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCC 
AGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCT 
AAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACC 
Sequence 481 

GACCTCTTTAAGTGAAACTTAGTATGCTMTAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACMCMCAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 482 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGT 

CAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACA 

AATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAG 

TCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGMTATTTTGATAATAACACTAAGAT 

GGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 483 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGA 

CCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCAC 

AAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACC 

CTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGC 

CCTGAAATGCANGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAA 

TATCTTTACAGCAGCCTCGAAGACTGATCAGGTTACTATACCCTCTNTTCTGTCCACGTG 

CATTTNAAAAACNTTGGCCGNTCTAGAACTAGTG 

Sequence 484 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAA 

TGCATCGAT ATTACA GTTAATTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATAA 

CACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTTTGTAAGTATTAATACCTG 

ATAATTTACATCCTACAGCCCTGCC I I I 1 1 I I M I I I I CAAGTTTGTCCCAGCAAGTCTT 

GGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 

Sequence 485 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I 1 1 I I I I I I ACCTGAAAATGCTTATTCTAGCTT 

CACATTTGATTGTTTGGCTAAGAAGAAMTTATTTATTAGACTTMTT^ 

TTMAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 

CTGATTCTTGCMCTTACAAGTAGTCTTTTTTTGTCTAGACGCTTTCAGGACCTTCTT^ 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 486 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTGGAGGCCCAAG 

TGCTGAGACTCAGGGGACGCAGGCCTGAGGACAGTTATGCAGGGTGCAGCGGCCTACGGT 

AACCCATGTAGCAACAGAAACCCTTAGCTAACTGCCGTAATTTAAGGCAATTAGGAGCCA 

TTCATCATCCAGATGGCTATTGGCTCTAATCGTTACTGGCTGAAGGAACTATATATAGCA 

GCTACCTTTCCGCTCCATTCCCAGGCCTTGTTCCTGTCTTCTGGTTGCCAGTTCTGCACT 

CACTCATTCCGGACGCGTGGGTCGAAGACCT 

Sequence 487 

CTAT NGGGC GAATTNTTNCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACTTTTTTCATTNATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGTTTTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCMTAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGQAAAATGA 

ATTATTAAAATTCCATGGAATTCCTTTTGAAGTTTATGAAGT 

Sequence 488 

CCGGGCAGGTACAAATCAAGTCATTAACATTTTCAATGTCAAAAATACAGCACGCTGTTA 

AGAGTTCTGTCAGTGCTCATTATCCCACTAGATCCCACAAAGGGCAAACTCAAAAGATGA 

AACAAAGGCAACGCCATCAATAACCACCATATTCCACAGGCTTTCTCCCCTAGGACGTAC 
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CTN 

Sequence 489 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTMGTGMACCTAATTCATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGAGTTCTACTTTAGAGGAAATCCTGGTAACTGTTGTTTGTTTGTAAGTTA 

TAGCTGTAATTMTTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAAGCTATTNTTCA 

AGGGAGNTNTTTACCCATCATAAAGGNGGTTTTTAGTCATTATAGATAATATTCAATCAA 

TTANTACCGGGGGATGGCAAAAA 

Sequence 490 

CCGCGGTGGCGGCCGAGGTGTAATTTGGAGAATATTTAAAGCAAAAGAGCAAACAACAAA 

AACTAAGTTAACACTTACCCAGTGCAGTAAGGGAATTGTAAGATACAGCCTGCTTAAGGA 

GGTCTGCAGACAGATGCACCTMGATTTCAGCTGTTTTAGGTCACTTTTCTCAAAATATT 

TATTATCTGGCAA TGGG GATGGGAGTGGGGAACACCTNTCTGTGAGGCAAATGGTATCTC 

MCAMTACCGACTTTTCAAGGAAGAAAGCTCTCCACTTCTCTCATAAACTTATATACTA 

CCTTAAACAGTATGCAGTATTCGCGGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGC 

CCGCTCTAGAACTAGGTG 

Sequence 491 

CCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACCAAAGCTGATCTCTCA 

TTCGTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAAAAGTTTGAAATTCAT 

TTTCTTCAGGAAAATACCCAGCCAGTCCCTCTCAGTGGAAACCCTGGTTATGTCGTGGGG 

CTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCCGTGTCAGCTCGTA 

GCACAGAAGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCCAGATTACGTGGCCCCTTTT 

GGAAATTCCCAGGCCCAGGGACATGCTGGACTGGGTGCCCATCCACTTNATCACCCAGTC 

ATTCAACAGGGA 

Sequence 492 

CCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCAGNTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

NATCAGTGGNTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGMTATTTTGATAATMCACTMNATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGA 

ACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGA 

ACTGGATCAGAATGGTCCAAGGACTGTTMACAGAGGMGTATTTACATTTTGAAAACTT 

GCGGACGCGTGGGTCGAAGCTTGTACACCT 

Sequence 494 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTMACAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCT 

Sequence 495 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAAA 

CTTAMGTATMTAAAAAAAAGATTATTTCCTATACTTCAAMTCAACAAGATTTGATTG 

CATTTAMTTTTTCTGTCCATTTGTTCT 

AAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAAATCTACTGACATNTCTTGA 

GGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACAGAGACCACTACACACACAC 

ACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTTAATAAGACTGAAATGATGCCA 

TACATGCTTTTAAAAAATMAAAGTATTAAATTTTAMTTTTACTTCAATATAAGAGAAA 

AAAAAMA 

Sequence 496 
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CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGA 

CCCACGCGTCCGTAGTAATAGGGAATTMCTGACCCCTTTTGGATGGGGGAGAGCATCAG 

GCTGGGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGC 

CCTGACTTGAAACCATTAGCGCT^CTTGCTCTGTTTTGAGAAAMCTTTCCAAACTTTT 

GCATGAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAAT 

TGTCTCTGTGATAAAAAAAAMTGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGA 

AGCATTTTCTAGTGATAGAATGTATTTGTCTTTTTTCCTGGGGGGNACCTCGGCCGCTCT 

AGAACTAGTG 

Sequence 497 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCCAGGTC 
AGCACTTTGAACACTTGAACATCTATGAAATCACATAGTAAAGTGATAGGAGATGGGGCT 
AAGCTTTTAATGGCCTTTAGACATAGCATTAGACATAACCTAAGCTGAAAGGCTTTGGGA 
AGTTGTTGTGTTAMTCCCCMCACACTCTC^ 

AAAAAAAAAAAAAAAAGTTGCGGCCCGCTCTAGAACTAGTTGGATCCCCCGGGCTTGCAG 
GN 

Sequence 498 

CCGCGGTGGCGGCCGAGGTACCCTTTTATAAGGGTGTATCCCCTTTTGGTAACTTACTGT 
TTGTTMTTTGTAGTGTTCCCTGCCAGTAAGCTTGTAACACTCTAGTGACTCACCTTCGG 
GTGGGAGGGTAGGAAAGGGAGAGGCCTGCCTCCTAAACCTGGGAAGATGGGGAGAGAGTG 
GTAAACCTGAGAGCCCAAAAAACAAACCAAAACAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 499 

AGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGT 

CCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTG 

GACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAAAT TAAG TCTAATMATAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 500 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTMGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 501 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCC 

AGAGAAGCTGAAAGCCTTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGA 

TGTGGCCTCGAGGCTGGGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACA 

GTGATGGCTCAAATCCAGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTT 

TCCCAGGTGTGAAATGGGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCA 

CACAGCGAGCCTNAMACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGAAGTTT 

GCTGTCACTTA 

Sequence 502 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTCTTA 

TGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGT 

GGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGATTG 

GCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAGTGATTTCTTGTGAGATC 

CGGTTGTTTAAATCCANAGGCACCTNCCCCTACCCTCTAGCTCCCATTCCTGCCATGTAA 

GACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTTTTTGAGGCCTCCCCAGA 

AGCTGATGCCAGCCCTATGCTTCCTGCACAGCCTG 

Sequence 503 

CTACTATAGGGCGATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTTTTGAAATGCACGTGG 
ACACGAAGAGAGGGTATAGTAACCTGATCAGTCTTCGAGGCTGCTGTAAAGATATTCATT 
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TMCCTCACCACAGCCCCTGAATTGTTTCATCATTTCTCTGCCTGCATTTCAGGGCCTTT 

TCCATTCCTCCACCCTCCAACACCAACCCCCTACTACGCTGAGGGCTCCTGGAGGGTTCT 

TTTTCTCTCAATCAATTCATCTTTTCTGTGAACCTATTGTTATTATTCATCTTTGTGCTC 

CCAGCATGTAATACAATGCCTGGCACTTAGGAGAAGTTATCACGGACGCGTGGGTCGAAG 

CTTGACCTGCCCGGGCGGC 

Sequence 504 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTC 

CGAAATTCTTGGTTAAGGATTATTATAAAATAGMTGGTATTTCAGTAAATCCCTGAGGC 

TTAGGAGTCCAGGTACAATGTTGGTTCTCAATTAAAATATAAATCATGTCTAGGGACACT 

TAGGAACACAGAACATATATTTAGAGCTAGAAAATATACAGCTTCAGACCAGGCAAAGTG 

CTGGGATTACAGGCGTGAGCCACCACGCTCGTCCTCACATGGGGTTTTATTATTAGGATG 

GTAAGAGTATTATAAGGGATTNGGTACAAGGCATMTGAGTCCTTTTGCTTTTTAGGCTT 

TTGACTTNTGGTTTTMGACTTTTNTTTAGCTTTT^ 

AGGCTTNGGTTTTTTAATAAAGTTTGCTTGGGATNAAACNTGACCTTAATGGAAATTGTC 

CCCTNCCCCCAAAA 

Sequence 505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTGATACAAATAGCATGGATGAAACTCAAA 

ATCATTATTCTAAGAGCCAGATACTATAGCCTGTATTTTATGATTCACTTTCAATGAAAT 

TCTACAATAGACAGAACTATCTATCAACAGAAAGCAGATCAGTGGTTTTCTGCAGCCAGA 

GGTATGAAAGGTTTGAAACATGTGGCACCAGTAGGACATATGGAAACI I I I I I GGTGTGA 

TGGAAGTATTTTTTTATCTTGATTGTGTGGTGTTTGTTATACAGTGGTATACATTTGACC 

T 

Sequence 506 

GGGGCGGCCGCCCGTTCAGGTACACGTNTTTNCCMCCAATTTTATANGNATATATATAT 

TCTACTTCCAACACCCNTNTTCATCCTGGTNCAATCAAAGCCTGGTTNTGGCCAACAANA 

AACTCGTCAGGAGATCGAAGGNTGTAGATGTCTGCACGTGGCTTCCTTGGAGGTCCAGNG 

GNGACTCCCTCTTCCAAAATCCATTCTGTACCCGCTGGCTGCTCTAACGGGCAGGACAAC 

AGCGTATGAAGCCTGACTGCAACTAGGAGAAGTACCACACTCCCGGACGCGTGGGTCGAA 

GCTTGTACACCT 

Sequence 507 

GGCCGAGGNCAAGCTTTTACCCACGCGTTTGAANCCATCTGTTTGGNACCCNGAAAGGGG 

GCAGGAAAGGCTGGGGTCCCAGNCCACCCTAAGGGNATCTGAGTGGCCCAGGGCTNCAAG 

NNNNCCCACCTGNCCAATGGGACCCTTTCTGNCCTCACCCTACAAGGGGCACAAAGGGAA 

GACACCAAACCTGGCAGGAACTTTTCACGCAATCAAGGGAAGGAAAGGCANTCCTGGCAG 

AGGGMCAGCANGCCAAGCGGGAGAAGGCTCAAAGTAAGGAGGGTAAG 

Sequence 508 

CCGCGGTGGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGCTTTTCMTTTTATTGTAT 

AGTTTTTGATMTGTTAATGTCTGAGATCTTTATGGGTGAGTCTGCTGTCATTTCTGCTA 

TTTCTCGTAGTGATTTGCTTGTATGGTTTATGA I lilt I AAAAACTGAATGTGTATTAGA 

ATTGTGTCTGGTAATTCTTTAGGGACCCATTGTAGATGTATTTCTTCMAGAGCATTTGT 

GGTTATTATATTTGGGTGCTTGGGGCACTGCCAGTACCTGCCCG 

Sequence 509 

CGGCGGTGGCGGCCGAGGTATTGAACCAGGTCAAAACATTGTTGAATATCAAACCCAATC 

TATTTMTCTGTAAGAAACAAGGACCCTGAGAAAGATTCTGACCAAGGGTATGTGATCGG 

AAACTTGACAGATAAATGTAGTATACTTGTAAAGCCATACTGTGAAAAACTTGGGGATTA 

TTTGAACACAAATTATCACCTGGAAAAAGACAGAAAACAAGGCAGAAGACTGTGCAAAGA 

GGTTGGAATATTCAAAACTTCAGATTAGAAG 

Sequence 510 

CCGCGGTGGCGGCCGAGGTACCCAANNGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGMTGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATC 
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TTTAAACTCGTGAGGAAAATTMGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAA 
ATGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 
Sequence 51 1 

CCGCGGTGGCGGCCGCGACCACGCTTGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCG 

TCCGAMTTAATGAMTGTTTTACATTCTTTTAAAAACCTTTGAAATATGGTGTGTATTT 

TATGCTTTAGCAAATCTCAGTTTGGACCATTTCAGGTGGTCAGCAATTACACATGGCTAG 

MCTMGAGCAATCAGTTTTNTTCCACAGTTTTTCTAAMTTTTCTTGTCAAAAATCTTG 

ATGGTATGA ATtAC TCTTTTAAAAAGTGCACTTNACCAGCAACAGAAAANAACCCTGGAG 

GGGTATGGGTTTTAAAGCTGGTACCTNGGCCCGNTCTAGAACTAGGTG 

Sequence 512 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCA 

CAGGAATGAGCCACCACACCCAGCCAAATTGGGCACAAATTTAAAATTTGACTTTTATTA 

ATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCA 

MTCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTCTGA 

AGCGGACGCGTGGGTCGACTTGTAACCTGCCCGGGCGGCN 

Sequence 51 3 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATAAGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 514 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCACTATAGGGATCT 
AGATCACGAGCGGCCGGCCGCAC I I 1 1 I I I I I 1 1 I I I 1 1 I I I ATTTCTGCCACCTCTTTC 
ACTTGGGAATCTATTTTCACTGCTCTCCAAAGTTTTGAGAAGGCAATAGTCCTGGAAAAT 
GGGTCCTGAGCTCCTCTCAGCAGTCCTGCTTTCTTTCCACCTGCACTGTAAGGNGACCCT 
AACTGGGTCTAAGACAAAAAGTCGGACGCGTGGGTCGAAGCTTGTACACCTT 
Sequence 515 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTAGTTACCACTTCA 

TTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGACTTGAAGCTGGAAGCAAATAC 

AAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCGTCTCCCTAGCTCTCTATGGCTCTAC 

CTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTCTGACTGGGGATAGGACCACT 

GCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTTCCAGGAAATCCCACCCTGGG 

ATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCACCTAATGAGTGCTCTAGAGAT 

CAGCACCAAAGATGCTTGCCAGAGTCTTCTCTA 

Sequence 516 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGGMCCCTTGTTCAGGTTCTTTTTAGGCAACCCAAGCCAAGACAACAAAGTA 

AGATAGAGCCCCAAATGTGGTCGTATAAGGTTTTTCAAAGAAAGTAACACTTGAGTTAGG 

TCTTAAAGTTTNACCTAAGAAACTGCCCAGGTGGACAAGAAGAAAGGGTGTTCCAAGTAG 

AAATAATANCATGGACAAAGGCAATGTAGCAGGAAAAGTNTTCGTAAATTCAGGGAATTT 

CAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTTGCGGTGGCAGGGGATGTTG 

GAAATGTAAACMGAGTGAGATACAGMGATTTTATGTGGCATGCCAACTGGGAC7TTTT 

TTTGTAAACAA 

Sequence 517 

TCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGCTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAAT 

GTTCTAGGGNGGGNGAGGAAANCTAGGGTGGNTATCTAMTCAACAAATATTCTAGATAT 

TCCAATATCTAAATTATTGTTGGAAATACTCNTCCTGAAGNGNTCATTTGAACNCTAAAG 

CAGGAGNACAGCNTTTGTTGTATCAANATGGGCAGGGGTTTTTAAAGGGTNTCC A I I I I I 

TNTTANTTTCCNCATTATTAAATTCCNTTNTMATNNTTTTTAGGACCAAAAATTTTTCC 

CNTTTCTTNGAGGTNTTTAAAGGGGGATTTAANAAAATGGGNNANNTGGGGGGTTT 
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Sequence 518 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGTAGAGAAAGATTCAA 

AATGCTGCTGTTCTACCTGAGATGGGAAAAATGAAAGCAAAATAACATCAACAAAAACAA 

ACAAACAGCCTTGTAGTTCCATGTCACTAGCCAGGGATTTAAGACCAGCCTAGAGAACAT 

GGTGAGACCCCTTCTCTACAAAAATAAAAATAAAAATACAAAATAAGCTGGACATGGTGG 

TGTGTACCTGCCCG 

Sequence 519 

A7TGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGC7TCGACCCACGCGTCCGCTCACT 

TCATCCTCCCAGCAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTG 

GGGCTTAAGGAAATTAAAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTC 

AGAAATTTGATAAAAATAGCACTTTGCATTTNTTGAATCTTGAGCTAAATGGAAATTAAT 

ACTAAACATTCTNCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAG 

AAGAAAAGGAAATGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGGTTAAGGC 

TTTTGGCACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGGGAT 

Sequence 520 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGAGCTATGGACCTAAGGCAGCGAGTGGATTCATTAGTCCTCTTTCAGCTGA 

ATGCATGCTACAGTATAAGAAAAAAGCTGCTGCCTATATGAAGTCTTTGAGAMGGTTTG 

TTAGCTGCTGTTAATATTTAAATCAGAGGAAACATCAGGAGTCATTCTAGAGAATGGCAA 

GAG I III ICTGCAGTTTATATTGTTGACTTTTTATACGATATTGGGGTACCTCGGCCGCT 

CTAGAACTAGT 

Sequence 521 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGC 

TAGGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAAGACTGGCCAGCATTAGAGGTGGGTTG 

GCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCG 

TAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 522 

CGACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCA C I I I 1 1 1 I I I 1 1 I I I I 

TTTTAAGTAAAGAAGGTTTTATAATATAGTGAAAACAATACGGAGATGAAAACCAGGAGA 

CCTGGGTCCCGCCTTTGTTACAAATGCCTTTCCTAAAAGCTCCAGAATGGTGCGAGGTCA 

AAACAGATGGGCAGAAAGGAAGTGGTCATCAGAGCAAGAGAAAGAGCAGGTGCCAGGCAC 

TCACGTGTGCGGTCATATCAGGTAGAGATGATGAGTAGAGATCTGCCCTAGAAGACACTG 

MTTCTGAGATTCAAAGGGGAAAAGTTGATTTTATAGCCAGTGATTTTATAGCCCACTTT 

CCTGCCCCACCCCTACTNTAAGAATTGCGGACGCGTGGGTCGAAGCTTGTACCTGCCCG 

Sequence 523 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCAGAATA 

ACTAAATAAATAAAAGGCTAAAGAAAACTGGAACAGTACTGCGTCTCCATCTGAGACGCA 

NTCTTCTACTTCCAGCATCGNAGAGAAGGGCTAGGGACAAI I I I I I I I CAAAAGATTTAT 

ATACAGGCTTGAATCCAGAAATTAAGGNTAAAAGCATAAATATTGATAATTTCAACTAAA 

TTCAGAATGGNTTCAGAAAGATATGATACAACAATTTAGAATAAAACAAAGCAGAAGAGC 

ATNATATTTTGCGGACGCGTGGGTCGAAGACCT 

Sequence 524 

CCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGTCCGCTTAAAAGATTTTTTT 

TTTATGTAAACTGTTGAATATTTGAAATAGTCCACTTCACCTTAATGGGTCTTGTCTATC 

TTCATTAGTCTTCAAAGAAAAACCATTTGCTACCAAAGTAAATCAGTATTTTGAATGTGC 

TTCTCTTG I I I I I IGTTTATTAGCTAGTTCCTGTAAGCATTTCCACCAGAACTTGAGGCA 

MTCGTMGGMGCTGTTTCTTTTAAAACACAAACCACCACCAAAAATTTAAATGTACCT 

GCCCG 

Sequence 525 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCAAGACCCGCCTGGC 
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CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 

TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGGAGGAGAATCACTTGAACCTGGAGGCA 

GAGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACT 

CTGTCTCAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGCC 

T7TTCTGGTCTTTGAACACATCATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACAT 

GAACTATAAACGGGTAGACTAACTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGC 

TGAATT 

Sequence 526 

CCGCGGTGGCGGCCGAGGTACCAAAACCCGGCTTT7TTCGAAATACCTGCAAAAAAAAGT 

GGATGATTCCAAATCCAACTGAAGTGTCTGCTCTCTCTCCAATTCAGAACAACCAGAAGG 

GCCTGTCTTGAATTAGGTAATGCATTAAAGAAAAGTAGGATTATTATATTCCAATTCTTT 

CCATCAGATGTAAAACATTATTGGTAGATCACATCTGTTTTAATAAATCTGTAAGAAAGA 

CGTGTMTTATAATTATGTTACCATTGTATGTAAATGGCATTTTAACAAGACATATTAAT 

ACATTTTTATAGAGTACCTGCCCG 

Sequence 527 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGT 

CCGGTTAATTTGGGAGAAGGAAGAGAGAGTGACATATTTGGCTACCTTCAGGGAACAAAA 

TCTMCAGCACAGATGGTAGTAGAGGAGATACCAATTTACATATTAAGGAGCTAGAGTTG 

ATGATGGTATGACTCAGCCCTCTGAGATTAAATTCTACTTACTAGGGCTATGAATGGAGA 

TAAGTAGGTATCCCACCTTTTATTAGAAGGTTCCTTAAAATAAATATGGGACTCTGGTCA 

GAGAGTAGGGCCATTMTTTGCTCCTGGTTTTTACCTGGCATCCACCCACCAGTACCTGC 

CCGGGGGGCGGNCGGCCCGCCCCGGGCAGGTCCCGCAC I I I I II I I I I 1 1 1 1 I I ICCTTT 

Sequence 528 

CCGCGGTGGCGGCCGCCCGGGCAGGTATTGCCCTTTGATGTCCCCATGAGGGCCAGGCCC 

AGGCAGMCCCATCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAG 

GATTAATATGGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTC 

TCTTAGAGTTCAATGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGGAATA 

TCTAAATATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAACATTTAGT 

GGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGTGCAAA 

TGGTATTTCTCAGCTAAAATCTCCCTTATGAACCCTTCTCGAAATCC 

Sequence 529 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAACA 

AGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGCT 

TGTTTTCCTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTTT 

TGCTTTTAGCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAATT 

CAI I I I I IGAAAGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATTG 

CAAACATGTCATCACTCCCTTGATGTTTACCTCCTTGTTATGCATTTTTAGCAGGCTTTA 

TTGTCACCTGAGA I I I I I I I I I ICTTTGACAGGCCGGAGTCTAGATGAAGGAAAATGTGT 

TAGAAGCACCTTATCCACAGATGGGG 

Sequence 530 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAGCAGAGCAGCTCCCTC 

GCTGCGATCTATTGAAAAGCTTGAGTCGACCCACGCGTCCGCTGCATAAAAGTTATGCAA 

AMGCATTTTATGATATACCAGCAAAAAACATGGAAAATGAAATTTTGAAAAGCAATGCC 

ACTTCAAAGATCCCTCAAGTGCCTAGAGGGAGAAAATGAGTTAATATGCTTTGAAGAACT 

GTATCC AGAAAA TAAAATTACAAAGGAGGAGAGGGATAGGATTCCAGGACAATCTCAAAA 

CTATTGCTTTTTCCTAMTTCATTGCAACCTTAAAATCCTAGCAAGTTCTTTAATGTAAA 

TTAACAAGCTAATTCTAGAATTCATATGCATATTCAAAAGTCGAATAATTGTCAAGGCTA 

TCCTGTAGAATGGGACAGAGAGGATTGAAATTT 

Sequence 531 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC7TTAATCCCGTC7TAC 
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AGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAGCC 

AAGATCTTGACCTACGCCATCTGATACCCTGAGCCCATGCTATAAAAGAGGAGCATTAGA 

MTATTTGAAAGATAGAAATGAGAACTAGTCAATATTTATTTTGCTTAGCACTGTATTCA 

GTATTATGGCATCTTAAAGTAGTTAAGACTCAATATTTCATCAAAAAAGTTTAAAATCTA 

ATCAGAGAAT 

Sequence 532 

CGCTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATAAGCCCACCC 

CACTAGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTAC 

TGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGG 

TTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAAC 

CCGTAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATAC 

CCCCTTGCCCCGCCAACGTAGT 

Sequence 533 

CTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCACTTTTT 

1 1 1 I 1 1 I I I GTAGAGACAGGATCTCCTTATGTTGCCCAGGCTAGACTTGAACTCCTGGGC 

TCAAGGGATCCTCCTGCCTTGGCCTCCAAAAGTGCTGGGATTATAGGTGTAAACCAGTGT 

GCCTAGCCTACAGTTTTTTMTTTTATAAMTGTTATTTCTMTTTTTCTCCAAAAGTAA 

AAGTGGCATTCCAATGGCAATATTAATTCAGGTATCCAGAACTCTTAACCTAAATTTGGG 

TGAGATGAGGAAAAGTGTATTGTTAATTTTATGTGTCAACTT 

Sequence 534 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCC 

ACGCGTCCGTAGTAATAGGAATTAACTGACCCCTTTTGGATGGGGGAGAGCATCAGGCTG 

GGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGCCCTG 

ACTTGAAACCATTAGCGCTMCTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTTGCAT 

GAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAATTGTC 

TCTGTGATAAAAAAAAAAAAATGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGAA 

GCATTTTCTAGTGATAGAATGTATTTGTCTTTTTTCCTGGTGGTACCT 

Sequence 535 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCACGGCCCAGGCAGAACCCA 
TCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAGGATTAATATGGG 
AATAAAAAATGAACCTTAAA 
Sequence 536 

GAANTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCAGTAGATTTGGAGAGTAATACAAATC 

CTTTCTTTCTGGTTAGAACACACTGCCAAAAGCCACCTCTTTCATCTAAGGAAAAGATTA 

AAAATGCATGTTGATATCTCCTAACTATCACACAACTTCCACTATTACAATGAAAAATCT 

G GTCCC CTTTCATTGCCTTTGAAAACCNTTTTGCCGAGGTGGNTTTCAAAAAAACNCGNG 

ANTTTTNAAAAANTTG GNTTTGGTTTTACCNGGGGAAAGGGGACITrTTNNCNNTTTTTTT 

1 1 1 I I I I H I I I II M I NAAANGGNGATTNGGTTNNGGTTNTNCCTGGGGCCAAAATNCC 

NTTT TGNGGMCCTTTTTTGGGGTCCNAAAANNNACAAAANAAAGGGNTTGGGACNATNT 

TTTTGNATNCNCNCNAAAAAAA I I I I I I 1 1 1 1 I 

Sequence 537 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGCTA 

GGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTGAC 

ACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGGCC 

ATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGTAA 

GCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 538 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCG 
ACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCA 
CAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAAC 
CCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGNGGTGGGAGGGAATGGAAA 
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AGGCCCTGA 
Sequence 539 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGACAAGCTTCGACCCACGCGTCCGCAAGT 

TTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCAC 

CAGTAGGTTGGACAGCATATMTTTGCATCATTTTGTCCCTTGTAAATCAAGATGTTCTG 

CAGATTATTCCTTTAA 

Sequence 540 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAAC 

M GCAGC CAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGC 

TTGT TTTC CTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTT 

TTG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAAT 

TCATTTTITGAMGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATT 

GCAAACATGTCATCACTCCCTT 

Sequence 541 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGA 

TTGTTAAAAGAGGAGTCTAGAAAAATTAATCCTGAACCCTAAAGAATAAATCTTAAGTGG 

TGGATACATGGGTTGAATAGTGTGCTCCAAAATTCACATCCACTTGAAACTTCAGAGAGT 

GGCCATATTTGTAAATAAGGTATTTGCGGGTGTAATCAGTTAAGGATCTCAAGATAAATT 

CATCCTGAATTATAAGTTGTCCTTAAATCCAATTACTGGTATCCTTACAAGAAGGTGAGA 

GGAGACAGAATAGAGCCATCTGAAAAGGGTCAGAAA 

Sequence 542 

CTAACACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 
CCACGCGTCCGCAAA AATC AATCAAGGGTTCCTACTCAAGTAAAAAGCAACTTGTAGGAA 
AATAATAGGGGATATATTTTGCTCATTAAGGATCTTTTTATAGTGGCTCTTGGTGCACTG 
CCTGTGAGTTAGCCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAAAAAAAAAANGT 
Sequence 543 

CTACTATAGG GCGAT TGGANCCTCCCCGCGGTGGCGGCCGAGGTACTTCCTGGAAATCAA 

TTAACTGAGTCTTTTGAAACCCCTAGAGAAGATAGGAGAAAATTGGTTCAGANCGAGCAT 

TTAAATTAAGTCAGCAMGTCAGAATTTAAAATTGGGCAATTCCTTGTCTACATTTTCTT 

TACACTCAA 

Sequence 544 

CTNACTATAGGGCGANTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCC 
T TAAC T TAGA TCGTTACTNATGTTTCTAAATCANGTGGAAGCACATTTCCTTTTCTTCTT 
CTTTTCTTTTACTGNNMTATCCTNATTCTNTATTTTACCAGTGGNGAATGTTTAGAATT 
MTTTCCATTTAGCTCMGATTCAAGAAATGCAMGTGCTATTTTTATCAMTTTCTGAA 

AGCCTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCC 
Sequence 545 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCACACAGAAGGAGG 

AGGGAGCTAATCCAGTAACAAACATTCAAAGATTAAATTGTAGATATGCACCTCTGTATT 

TGGCACTGTTGATTAATATTATAACACCTTCCTCTCAAAGACAGGCATTCTTAAGCGTTA 

GTCACMTATACCAGMTTTGCTATTCATATTAAAACCACCTTTTAAACTTTATAACAGT 

AACCAATTATTATAGTTTTAAGAAACAAAACGCAATGAGAACTGGGAATGGAATTCAAAT 

CCTC CAAATTCTTGCTATGCTCCAAGCTGCCATCCATAAAACAGGTTTAATTTGGGTAAT 

TTTTCCATTGTGGGGMGGGTCMCAAGMACAATTTAAAGACAATATTTTCCAATACAA 

ATAAAGACATACACTTTTTGTT 

Sequence 546 

TACTTAGGGCGATTGGNANNTCNCCGCGGNGGCGGCCGCCCGGGCAGGTACAAGCTTCGA 

CCCACGCGTCCGMATAATAAAGCTAGAAGTAATATTTTTCCTTTTGTCTATTTTCCAAA 

TTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTATTTAAGNTTAAAGGGAATATTTA 

TTGATCACCTCTATGTGCTCANTACCT 

Sequence 547 
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TACTATAGGGCNATT6GAGCTCCCCGCGGTGGCGGCCGCAC I I I 1 I I I I 1 1 1 I I I 1 I 1 11 

TTCCAGTGCTAAAACATCAGATAAGAGCCTACCTGACATTTTGGAGAATTTGCTGNGCTG 

GGATTGATATTCCGCATTGCCTAAGAGTAAAAAATAAGACGGACGCGTGGGTCGAAGCTT 

GACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCACAGGAGGCAGAAGGAAATCCTCA 

ACCTTCCGAAGAAGGCGTAAGCCAGGAAGCAGAAGGAAACCCCAGAGGAGGGCCGAATCA 

GCCTGGGCAGGGATTTAAAGAGGACACACCCGTTAGGCATTTGGACCCTGAAGAAATGAT 

AAGAGGAGTAGATGAGCTTGA 

Sequence 548 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNAGGGTCGCGGTGGGTCGA 

CTNANGCTAGAGAATTGTAATACGACTACTATAGGGATCTAGATCACGAGC 

Sequence 549 

ctacttagggcgaattggagctccccgcggtggcggccgagg 1 1 1 i 1 11 1 1 i 11 i 1 11 1 1 

taccagmcatcacataagtttatttcagatgtmcagcaatgttaaaattgacaagttt 

mttcttaactgcaccaagtamcttagccatttaagtatttttttaag™ 

aaaaaactgagggagcttttcttttccaccaccacaccatggtttcccaatagttctctt 

tttggaggacttttcmttgatgagtamctgctttagatatttcagaacttc^ 

aaatgmagctaatctggacamctatatattgcatagatttctctacagattctttgct 

TTAAAA 
Sequence 550 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGATGAGGCGGGGA 
GGTGGGACCCCCAAACATATATCAGCCCAACAGCCCTAAGTCTCCTTCTTTATTATTAGG 
AAAACAACAACAACAACAAACAAAAAAATGGCGTCATGAATATGAACAGCATTGTCAGAT 
GAATTAGTTGAAGTGGN I I I I I I I IG I I I I I J I I I I I I I I I IGAACCTGCCCG 
Sequence 551 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGTGCTTCGCCCCAC 

GCGTCCGNAATAATTGGAAANGGCCTATAGATTAAAAAGCTGAGAAAGTATATGGTAGGG 

AGCACACTCCCCACAAGTATGAACTCTGNGATTACGACATCTCATAAATNCATGAGCACT 

CATGTTGGCTTGCTTTGTAGCTATGAACTTACCCTGTATTATTGAAACGTCAGCATAATG 

ACTGGMGGAGAAATTGGT CCATT TTAGAGCATTACTATTATGCTATCTGTCCATTTAAA 

TTAATMTTGCATTAMTTCATTTTAGAAGGNGCTATTACATTNGTAGTAAGAMGTAAA 

TTCATATATAMTATTTGATTATCAGATGGTTTACTTACAGATACTTATTTTCCTGTAAA 

ATAGGAGAGTTTACCTGAAGAAAAATAAAACTTTTNACTTTTCTGGGAAAAAA 

Sequence 552 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACCAAGTGAA 

TTTAMTMTTGGTGTGGATTGGCCAGTAGCTMGMGTGGGCTTTTAAAGAGTNTTGAA 

NATNGMNGGGTTTTTNTTTCTTTTTTAAA 

TGA AAAGC TAGGNNTTGCCCTNTTCATGTNTACTCTCCTTCCAAATAGTTATATCCAAAA 
CTGTTTTTCCCTCTCCCCTACCTTGTCCCCCCTATTMMTANAAACNGGGATTGATTAA 
TGTCCCGCTCCTGAATACATGTAAAATTTGTACCTCGGCCGNTCTAAAACTAG 
Sequence 553 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC . 

CGCTTGTCAATAAAAAACAAACAATTGCTAAATAAAACAACCTGAGAAAATCTCCAGAGA 

ACTATACTGAGTGAAGGAAGAAAAATCCCCAAAGATTACACACTGTATGTCATTTATATA 

ACATTCTTGAAATGACACAATCACAGAAATAGAGAATACTGGTCACTANTGCATTAAGGA 

AGGTGTGGAAGGATGTAGTGATGGGAGGAAATGTGTATGGCTGTAACAGGGCAACAGAGG 

CNTCATTGTGATGATGGAAGTGTTCTGTNTCTTGGG I I I I I I GAATGTCA 

Sequence 554 

CACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAG 
AACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGNCCAACCTACTGGTG 
AACTGGATCAGMTGGTCCMGGACTGTTAAACAGAGGAAGTNTTTACATTTTGAAAAC 
Sequence 555 
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CTACTTAGGGCGAATTGNANCTCCCCGCGGGGGCGGCCGCTAAAGGAATAATCTGCAGAA 

CATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAA 

CTGGATCAGAATGGTCCAAGGACTGTTAAACAGAGGAAGTNTTTACATTTTGAAAACTTG 

CGGACGCGTGGGTCGAAGCTTGTACACCTT 

Sequence 556 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I 1 1 I I I I I I I I I 1 1 I I I G 

TTTTGGTTTG I I I I I IGGGCTCTCAGGTTTACCACTCTCTCCCCATCTTCCCAGGTTTAG 

GAGGCAGGCCTCTNCCTTTCCTACCCTCCCACCCGAAGGTGAGTCACTAGAGTGTTACAA 

GCTTACTGGCAGGGAACACTACAAATTAACAAACAGTAAGTTACCAAAAGGGGATACACC 

CTTATAAAAGGGTACCTN 

Sequence 557 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCACGCGT 

CGGCTTTCAATAGATCGCAGCGAGGGAGCTGCTCTGCTACGTCACAATCTTTCAAAAAAA 

TGAACATGTAAGAAAAAGCAGTTTTCATTGTGCTAATTATTGCAGGCCTTCATGCACGTA 

AACCTCAACAAAATGTGTGCCAACAATATACAAATTTCCATATAAACAAAGTCATTGATG 

ACTAACAAAATATAAACATGGNTTCTTTTATATTAGA I I I I I I I I AAAAAAAAGCTATTT 

ACCAGCAAGAAAAAACAAGTACCTGCCCG 

Sequence 558 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATT 

GATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTAT 

TAAATCCTCCAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGG 

TGTTTTTTTGTGGACCATTTTAAANCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGT 

TAGG 

Sequence 559 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGGCTTCGACCCACGCGT 

CCGCTCCAGGAGACTTCTGCTTACCTCTCAGTGATCAAAAACCGTTTCACCACAGTTACT 

TACCACGTCCTACCGATCCGCATTCTCGCAAGTGTCCTTCACTCCATTTACTCTACTGCA 

TTTTTCACTGTATTTCTCATGCCAAAACTTGGGCTTCTCCACCAGTCTGCACACGTTCAT 

GCTCTCAATTCTCACAGCCATCTATTTCATTCTCCACTAAACTGTTAGAGGGATTTCTGN 

AGAAATTAAAGAAATTCCTATCACTCCTAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAG 

AACTAG 

Sequence 560 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACG 

CGTCCGCAAGTTTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTG 

ATCCAGTTCACCAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATC 

AAGATGTTCTGCAGATTATTCCTTTAA 

Sequence 561 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTTATTTATTTCTT 

TTAGGAATTGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTA 

GGCAGGAACATTAGACTCCAAAAGCAGAGAAATGCTTAATTTTTCTTCTACCTGTTTCAC 

CACATTCATGTANAACTGTAGTAAAAAAGATGGNGAATCAGGCTGAATCAATCTAAATAA 

CAACTTAAGGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTT 

TAAATGGNGCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTG 

TCATTATGACATTCCATGAAAATGAAGAAAATCTTTCANGAAAAATTTAGAAAATAAAAA 

TGTTTACT 

Sequence 562 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTTAACT 
TAGATTGTTACTTATGTTTCTAAATCTGNGGAAGCACATTTCCTTTTCTTCTTCTTTTCT 
TTTACTGNTAATATCCTTATTCTCTATTTTACCAGTGGAGAATGTTTAGTATTAATTTCC 
ATTTAGCTCAAGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTAC 
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TGTCTTCTGCTTTGGMGTCCCACAACAGCTCTTTAATTTCCTTAAGCCCCACTTTCCTC 
ATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGAGCGGA 
CGCGTGGGTCGAAGCTTGTACCT 
Sequence 563 

CTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGTAATCACATA 
AATTCTGCAATCATCTGTTTATTTAGCTTAACTG I I I I I I I I I I I ATTTGTTGAAGTTGT 
TGTTGTTATTNCAGTCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAA 
GTTCCATAAT 
Sequence 564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTCTTCAGATTCAAAT 

GCCAACACTAATTTGAACTTCTTTGGGTCTATGACAGTTTGCAAGCCATACAAACCCAAA 

GAGCTAATCTGTGATTTCTTAACTTGAGAAAATAATAATNATAACCACCACTGGAACCTA 

CATAGGTTTGTNGNTTATTTAACATGACTTMCCTTTTGTTTGTATTTTTTTGAAAAAAA 

AAAAAAAAAAGTGACCTGCCCGGGGCGGCCGGACGGCCGGGCAGGTCGCNCTCAAATTNT 

TNAATTTNTTTTGGAAAGACANGNATTTTTTATTTTTGCCAANGCTAAAACTTNACNCTG 

GGCCTTTAAAGGGGATTCCNTNCTGGCCTTTGGGCCNCCAAAAAGTGCTTGGGATTNTTN 

GGGTNNAAACCCCGGNGGGGCCCTAGCCTACCAGTTTTTTTAAA 

Sequence 565 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 

GTGCAAATTTGACTTTGTAAATGGCCCTTGGGCTCTGGGAGGAAAGCAACTGTTGGGCCA 

TGTGGTTGTATCTTTAGTTTTGTAAAGAATTGCCAAACTGTTTTATAATGTGGGTATATC 

TTTCCACACTTCC AGCACAA TGTATGAGTGATCCAGTTTCTTAGCACCATAGTCAGAATT 

TACTGTTGCTACTATTTTTTAGCTATCCTGATAGATGTGTAGTGATATTTTATTCTGGTT 

TTGAAGCAGTGTCATTGTCTGGGGTAAATCCTTGAGGTTTGTTGTCTCAGTCAAGGGGAA 

TCAAGGGACATGGACACACAAGTAGTGAATTTAAGAGTGGAAGTTTAATAGGTGA 

Sequence 566 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCA 

AATTCTCCAAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAA 

AAAAAAAGT 

Sequence 567 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGCATATTTTCATTACTTGACAATGTGGGATGGGTGCMTTTATTCCATCT 

T CACTA AAATAGAAGCAATTCCATAGGTACCATAAACCTATTTTAGGTACCACAAGGTGT 

CTTTTTACACAGCTCATTTGMTACAGGTGTTCTGAGMGGGGTTTCTATTTTAAAATTA 

CCATATCAAAATAAATGTGCCTTATTTTTTTATAAGNCTTGTTAAATCAGTGTCCATATT 

ACTGTTTGGGGAAGG 

Sequence 568 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCAGGTACAAATAAGCCCACCCCACT 

AGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGT 

AAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNNAGAGAAAAAAAGGCCATCTATACCAATTCTAAG 

Sequence 569 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAGCGGCCGNC 

CGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTNAAAATTCCAAG 

CTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGTA 

TACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGNGCCGAATTGAAATC 

CAGAATACAATAGCTNTGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAAC 
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CTCATGGGCACACTTAAAATAGTTTGCATCCCACACTACAGGCTTCTCTGGAGGGATTTA 
ATACTTTGGG 
Sequence 570 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTATTTTCCCTCAGTAAC 

ATGTMTTGCTACATTTTTTATAAGAAGGTATGGTTAGAAAAAAATGTGAAAGATCACTT 

AAACCAAAGCCAGTTACAAGGAGTAATCTCTCCTGTTGGTTTACCTTCACCTCANAACTA 

CMGMTATTACAATACATAGTGAATAGTTGTCTGTMCATTTCTACCAGTTGTTTCANT 

AGCATATTGGTCTTGGCATTTCTTGGCACTGTGGTTCTGCTGTATTATTTGTGATGTCTT 

ATTGTTTGTG AG CTTTTGTTTTTTTTT AA AG AA A AAAC AA AAACTA AGTG 

Sequence 571 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGTAATCACATA 

AATTCTGCAATCATCTGTTTATTTAGCTTAACTGN I I I I I I I I ATTTGNTGAAGTTGTTG 

TGTGTTATTTCAGNCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAAG 

NTCCATMTTATGGGGMGATTTCCTCTGAATTGGCTAAATTCCTGTAGCTGAAAAAAAA 

AAAAAAAAAAAAGT 

Sequence 572 

TTAGGGC NATT GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 
CCGCAAGTTTTCAMATGTAMTACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATC 
CAGTTCACCAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATCAAG 
ATGTTCTG CAGATTATTCCTTTAA 
Sequence 573 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTACGGC 
TTCTGACAGGCCTCAGAAG AACATTCC CTACCCCAAAATTATAAAAATAATCTTGTATAT 
ATTCT TCTCMAACT7TATACTTTTTTTAAGGCTTGGATTTTTAATCTATCTGGAATGTA 
TTTTTAMTACTGAGTGAGTCACTTTTCTCCCGGACGCGTGGGTCGAAGACCT 
Sequence 574 

GAATTGGAGCTCCCCGNGGTGGGGCCGCTCGAGGCCGCTCTGACCTCTTTAAGGNANACT 

TATTATGCTMTAGTTGATGCGCCCTTTTGCTGNANCAGTTACCATAGGTTACATGATAA 

NTATATATTGTTGNGATGATCTGATTNCCTGNNTNCCCCACCCNTGCAAAACAACAACAA 

AMCCTTTACCAGGCTNTATAACANGGGGACCAAACTTGNTTTTGCTCATCATTGCCGGA 

CG 

Sequence 575 

AGGTA CTTACCCCAGACAACGACGCCGCTTCACCATGATGATGGACAACAGGCAACTTTT 

TTTTTGGAGTTTCA GCTTG CTTCCMCAGGGACGGTGAGTGTGAGGTTTATTCCCATTTC 

TAAGACGATAGAAGTTTTCAGCCTAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAAGT 

TTTGTCTTGGAAATTCTCCTTAATGACTAAAAGTTAAAGAATTAAACAACCTAGCTGGGC 

TTAAATTTCTTNCTTACCCATTAGAAGTACCCTTGCCC 

Sequence 576 

TAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGNGGTCTAAGAGACAGGGTCTCACTACA 

TTGCCTAGATTGGNCTCACACTCCTGGGCTCAAGCAATCTTCCTCTCTTGGCCTCCCAAA 

GTGTTGGGATTGCAGGTGTGCGCCACTAGGCCCAGCTTGAAMATTTTTAATGCATGTGG 

TMTCCACAGGAGATCACATTTAGTATATGACCAAGTTAATTAAGAAGNCAAAAAACACG 

TTMATTTAAGCAGMTAAGGCTGGGTTCGGTGGCTCATGTTAGTTTTTATCCTTAAAAT 

TGTCTGAGTTCTTAGAACACAGAAAAMCAMTTTGAATGCATTTCTAACAGCTTAATAA 

TTTATATGTCCCATTATG 

Sequence 577 

GGGGNGGCGGCCGCCCGGGCAGGTCAATGCTTCGACCCACGCGTCCGATTTCAGGTTGAC 
TTTTCTCACCTTTAACCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCAT 
CTGGGTTATCTGTAGCTTGAGTTGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGACCT 
Sequence 578 
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AGGTGTCGACTCMGCTTTCAGATATAGGCATTCCAGAATCTTCTCTTTACGAGTTCACC 
TGCTAGTATAATCTCCACMCTTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGC 
ATCACAAGTTGTACCTGCCCG 
Sequence 579 

CCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATGTGAACTA 
CAGGAMTTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGT 
GTCATGACCMGCTAGATCTCTTTACCATATCATTAATAAMGTCAMTTTTAAATTTGT 
GCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 
Sequence 580 

GTGAACCTGCCCGGGCGGCCGGCCGCAC I 1 I I I I I I I I I I I I I I I I I I CCTTTTGAGACA 

CAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCCCGCTTT 

CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTTCTTTTT 

CATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTGAACTCA 

AGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTTAGCCTT 

GTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTC 

Sequence 581 

CCGGGCAGGTACCCTAAAACTTAAAGTATAATAAAAMMGATTATTTCCTATACTTCAA 
AATCAACAAGATTTGATTGCATTTAAA I I I I I CTGTCCATTTGTTCTTCTATGTAATACT 
TTA AAA AT AATTG G C AT AAAAATTC A ATC AATTCATA AAAGTCC AAAG C AA AAAA AAC A A 
ATCTACTGACATCTCTTGAGGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACA 
GAGACCACTACACACACACACACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTT 
AATAAGACTGAAATGATGCCATACATGCTTTTAMAAATAAAAAGTATTAAATnTAA 
Sequence 582 

CCGGGCAGGTACTTATACTAGMGATGCTCCAAGGTTTCAGAAAGGAATTMTTACTTTC 

AATTTGCACMTTTAGAACAMTATCTGGCTTTTCCCTAAGCTTAATGATTTTCCATT^ 

ACACMCTAMATATAATAGCATTATTTTATAATCAAGTTTAACTGATGGTCTATGATAG 

TAGAGCGATTTAGTATTTTGACAAAAATCTTATGAGACATGAAGTCATTCMTTTGCCGG 

ACGCGTGGGTCGACTCAAGCTAGACCT 

Sequence 583 

AGGTCGCCATCACACCCGGCTAA I I I I I I I IGTAI I I I I AGTAGAGACAGGGTTTCACCA 

TGCTAGCCAGGATGGTCTCAATCTCCTGATCTCGTGATCCGCCCACCTCAGCCTTCCAAA 

GTGCTGGGATTAMGGCGTGAGCCACCACGCCTGGCCAGGAGATTCTTAATTATTTCTGA 

ACTCTATCAGTTTTGTATTAGGACATCTTATTTMTATTATCAAAAGATAGTTCCTCTTA 

GAGGCATAAATCAGTCAATCAACAAACAATAGGCAATCACGGACGCGTGGGTCGAAGACC 

TGCCCG 

Sequence 584 

AGGTGTACMGCTTCGACCCACGCGTCCGCATTTTTCTGGTGTTCCCTCTTACGTGCACA 

CCCCTTGCTCCCCTTTGGGTTGACTTATAATCTGACTTTTGTGACAGATGTTAGGAGGTG 

GAGCAAAGGAATTTCAGACCAATCAGTTAAGAGACTGCTGTGGGGTAAGAAAAAAAATTA 

GCCTCTTAAAATTACTCTTATCAAAGGAAAAAAAGTTGGAAGCACATGATAGTATAACCA 

GAAACATGACACAGAAGAATTAAGGGAAGAACCTGCCCG 

Sequence 585 

CCGGGCAGGTGGAMGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTMGTTATAGCTGTMTTMTTT 

TCCCTGTATTCAAAGCCCCCMACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTGGTTTAGTCATATAGATAATATCAATCAATAAT 

Sequence 586 

CCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTTTGTTGATTGACTGATTT 
ATGCCTCTMGAGGAACTATCTTTTGATAATATTAAATAAGATGTCCTAATACAAAACTG 
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ATAGAGTTCAGAAATAATTAAGAATCTCCTGGCCAGGCGTGGTGGCTCACGCCTTTAATC 

CCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGAGATTGAGACCATCCTG 

GCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAATTAGCCGGGTGTGATG 

GCGACCT 

Sequence 587 

AGGTACATTGTTAGACMGTGTTTATCACTAATCTGGAATACATCATCTTCAATAAGGCT 

CTTGTTTTCCTCCMGCTGCACTGCTCACACTGCTCAGTTTTCTGTTAAGCAACCTGCTC 

ATTATAGTAGAGCACCMGGTGATCTGTTCTTCTGTTCTTCAGAAGTTCACTATTTCTTG 

TTGCAACAGGGCTACATGATTTTAAGATTCCTCAAAGTCAATACGAATTAACATTATTTT 

CCATTTCCATTCTGTATATCTTCACATTCCATAAATATAATACTCATGTATACGTTAAAT 

TTCCTTATAAGTTCAACACATTGAAAGCTAAAATAAAGACTTCCTACTAG 

Sequence 588 

CCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTCCAA 

GCTGACCTAGTGCMCACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGT 

ATACTTCAAAAG ATCTG GTCATGAAATTTTTAG CTAATACATAAAGTGCCG AATTG AAAT 

CCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAA 

CTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAACAC 

CTCATGGTCACACTTAAAATAGTTTGCATCCCACACTA 

Sequence 589 

CCGGGCAGGTGACTTGGCTGTGAAAAGTGCTAAAACAGATAAAAGACTATACTGACAGGC 

AAATGGAGCCTGTTATGACACTGACATTGAAGGTGAAAGGAGAATCCAGTTCACATTAGC 

CAGGGTCTCAGGGACCAGGTTTTGAGGCAGTATTTCTGCCTCTTGAGGACAGGGCAGAGC 

AGGTGGGTAAAAAGCAAAGAGACCAGGGAAGGGGGACTAAAAGTAAGGGAAACAGCATCT 

GAGGMAGGCTCCTCTGACTGGATTTTCACAAACATTATTTATTAACCTCACTAAACAAG 

GATAATGGGACAAAACAGGGGCAAGCTGGAAAACCAGCAGGGGTATTTGGCAG 

Sequence 590 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTTGGTCACTGNCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGTTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGAC 

Sequence 591 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTAACTGTTGTTTGTTTGTAAGTTATAGCTGTMTTAATTT 

TCCCTGTATTCAMGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTG GTTTAG 

Sequence 592 

AGGTCAAGCTTCGACCCACGCGTCCGCAGCCTGGGTGATAGTGAGATCCTGTCTTAAAAT 
GAAGAAAGAAAGAAAAAAAGAATGAGAAGGAAGGATATTAATTGAAGTAAGAGCACATTT 
GATTACAAAATAGAAGAGGAGTAAGTGAGAACTAAACGGGGAATACAGATAGCAGAGATT 
AAATAGGCTATAAGAAAAAAAAGGGATGATAATAAGACCATGGTAGTACCTGCCCG 
Sequence 593 

AGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGTTTA 

CAATATGGTGACAACAGATAACTGTAAAAACTTCTTTTTCAAATAGAACCAGCAGGAGCA 

TGCATGG MCAC ATATACCAAACATCTTTCTGATMCATTAMCATTTTTAAAAGATGTT 

MATGTTCTTTTCATTGNGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTGTGTGGAAC 

CTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAATTTTCAAAC 

TMATGATGGGGGMGGAACTAACATTTTTGCAGNCCCTGGAAATGTAAAATGTTGTACC 
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Sequence 594 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAATAGC 

ATTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACAC 

CGCTGCCTTCTGACGGTAAATGTATTGTAGAATTCATGTTGTTATCAGGCTTCAGTTTCC 

TCATCTCTAMATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAG 

TGGCATCTCCCATTTATTTTTCATTTGAMTAMACTTTTGAATTTTATCTTCTACC^ 

ATAACATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAA 

AGCAAAGGATCAAAGCTG 

Sequence 595 

CTATAGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTT 

MCTTAGATTGTTACTTATGTTTCTAAATCTGTGGAAGCACATTTCCTTTTCTTCTTCT^ 

TTCTTTTACTGTTAATATCCTTATTCTCT 

TTCCATTTAGCTCAAGATTCMGAAATGCANAGTGCTATTTTTATCAAATTTCT^ 
CTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCCCCACTTT 
CCTCATCAGCAAGTTGGTGTGGCANTGGATCATANTAGGTTGCTGGGAGGATGAAGTGA 
Sequence 596 

ATGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 
TTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCATATTCT 
TCCAAAAAAAAAAAAAAAAAAAGT 
Sequence 597 

CCGGGCAGGTCTTCGACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

AAAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTGGGATAATATGG 

CTCAC TTTTTACTTCATTTACAATATTTAATAAGTGCGATTTTAGACTTGAGAAGAGAAT 

ATTTTCTGCTAAAATTATCCCCACTAGAGATAATCACCAGTGAATTAATACACTGCAGCA 

ACGGAACCAGTCAGCT'I I I I I GGTAATCATTCCCTTCCT 

Sequence 598 

CCGGGCAGGTCTTCNACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

MAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTAGGGATAATATG 

GC TCACT TTTTACTNCATTTACAATATTCMTAAAGTGCGATTTTAGACTTGAGANGAGA 

ATATTTTCTNCTNAAATTTATCCCCNCTAGAGATAATNNACCAGTGAATTNATACACTGC 

ACCNNACGGAAACCAGTCA 

Sequence 599 

AGGTGCTTCG ACCCAC GCGTCCGGTATTTC^^ 

TAGATGCTTTCTTTTTTCCCTTGCCATAATAGCTGGCTTGTAGAGAGAGTTATGTTTGAA 
AAGGCTTGCCI I I I I ICCGTCGCTCTG 
Sequence 600 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

TGTGGTGCTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACC 

TTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTTGAGTATCTGGAGGAGGATGCCCA 

GAAGTCCNCACAGGAGGGGGTGCTTGGGACCACACACTGATGCTCTTGNCATTCAGACTC 

TGAGAAACATGCCGCGTGATGAANAAACCATCCCAATTANANGAAGCTAGCTGNNTTNCA 

TTGNAGCAGCTTTACCCCAATTT 

Sequence 601 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATT 

GTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAAT 

TGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAG 

GGTGGAGGAATGGAAAAGGNCCTGAAATGCANGCAGAGAAATGATGAAACAATTCCNGGG 

GCTGCGGNGAGGTTANATGAATATCTTTACAGCAGCCTNGAAGACTGATCANGTTACTAT 
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ACCCTCTCTTCTGTCCACGTGCATTTNA 
Sequence 602 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTMGTGTGACCATGAGGTGTTTTTTTGTO 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

T 

Sequence 603 

ATTGGAGCCTCCCCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAG 

ATCCGTTTCAGAAACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACC 

TGATCJTATTCTCATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAAT 

TTACTGGAGTTGGCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCAC 

CTGAGMTTTGATAGACATAGACCCAGAGTTACTGAGGGCAGGTGCTCTGTTTTGGGGAC 

CAGCMTCGGTGCTTTAGCAAGTNCTTTGGGTGATAGGGGTTNTGGAAACTACTGCTCTA 

AAGCATNATCTGTTTTTGAC 

Sequence 604 

GGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGTC 

CTGAAAGCGTNTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTGG 

ACTTCTCAGCTTTCCTCATTAGMGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGGA 

MATT MGT CTAATAAATANTTTTCTTCTTAGCCCAAACAATCAAATGTGAAGCTAGAAT 

AAG C ATTTTC AG GT A AA AAA A AA AAAAA AA AAGTG CG G C 

Sequence 605 

CCGGGCAGGTACANNTTGTGATGATTTTGGCAGCAATTACAGAACAACCAAGGCCATTCA 

AGTTGTGGAGATTATACTAGCAGGTGAACTCGTAAAGAGAAGATTCTGGAATGCCTATAT 

CTGAMTCAGAATCCTAGTAGTTTGTAGTTTGCCTCTTCCTAGAAGTTCAAGAGACTCAA 

GTCATAGGCTACAGATGTACCTN 

Sequence 606 

AGGTCTTCGACCCACGCGTCCGCAACTGTTGATCTMCTTTTCCACTTGAATGTCTAATT 
GGCAAATCAAACCTAACATGTTCCAAACGAGTTCTGAAGCACCCCCTCTGCCAAATCTAC 
GTCTCCCACAGCCTTCCCTATTTCTCTACCTGGTACCTGCCCGGGCGGCCGCTCGACCTG 
CCCG 

Sequence 607 

AGGTCTTGAGTCGACCCACGCGTCCGGAGATGTATACGCCACTATAGGAACTATAAGAAA 

AAGTCAAATGGAAATCTTATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGA 

TAAGGTGTCAGTGAACTCAAAAATCAATCAATAGAAACTACTCAAACTAAAACTCAAAGA 

GMAAMAMGATGGGAGATAATTATTTTTTAAGMTTGGTCATCAAAATGTAGCAACAA 

GTTTGCCTTATCCTATATCATTTGAATTTTCMMAATAAGCTCATTATACAATCTTTAA 

MTATTTTGAATAGAACTGTTTCATGTGTTATTTGT 

Sequence 608 

AGGTCAAGCTTCGACCCACGCGTCCGGGGMCCCTTGTTCAGGTTCTTTTTAGGCAACCC 

AAGCCAAGACMCAMGTMGATAGAGCCCCMATGTGGTCGTATAAGGTTTTTCAAAGA 

AAGTMCACTTGAGTTAGGTCTTAAAGTTTCACCTAAGAAACTGCCCAGGTGGACAAGAA 

GAAAGGGTGTTCCAAGTAGAAATAATAGCATGGACAAAGGCAATGTAGCAGGAAAAGTCT 

TCGTAMTTCAGGGAATTTCAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTT 

GCGGTGGCAGGGGATGTTGGAAATGTAAACAAGAGTGAGATAC 

Sequence 609 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

GTGGTACTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACCT 

TTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTGGAGGAGGATGCCCAGA 
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AGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTGTCATCAGACTCTGAGA 
ACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCTGACTTCATGGAGCAGC 
TTACACCAATTGAA 
Sequence 610 

AC I I I I I I I I I I I I I I I AGCTTGAGTCGACCCACGCGTCCGGGGATCTAGATCACGAGCG 

GCCGGCCGCCCGGGCAGGTACGGAAGCCATGCACTTGCCTCTCCTTCAGAGCTGGGATTT 

TTTTTCATTTTGCTGGCTGTGAGCACACACACGCCCACAGGTGCCTAAGCCTCTTGTATG 

TGTGTTTTGAACTGTGTCCTCTGAGTTCTGTGTCTGGGTGCATGCTCTCCTCTTAGCGTG 

GGTCTGCTTCCCCTGTGTAGCACTTCACAATGTTAGGCATTTGTCTGTGATAGCAGCTGT 

TCAGTAATTTCCTACTT 

Sequence 61 1 

AGGTTTCGACCCACGCGTCCGGAATTTATCTGGCCAGGCATTGGTAGTTTACAGAAGTCT 

ACCAGATGATTCTAATGTGTGGTCAAGACTGAGAACTATGTGTTTAATTGGGTTCATTTC 

AAGAATACTGTAAAAATTTTATCTAAATACTAAATATCCATAAAAGAAACCTCGGTAATC 

AGGCCAGGTTTTTGAGTTTTTCCAGATTAGCCCAACTACAGGGGAAAGAGACTTTCGCAC 

TATATCCCAGAGTCTCTGCTCCTGCTTCCAGCCTCAATGCACTGGGCCTTTCTGCTGCCT 

TGGAGCACTTAGAGGGATTACAGGAGGAGTGATCTGTGGAGTT 

Sequence 612 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGCCCGGGCAGGTACATACAGCCTGAAG 

TTAACCTTTCTATGTTAMTGMMMCTTTGTCATTCATCAGGTCACAGAAACCAAAAA 

CTAAACAATGCAAAAAAAAAAAAATCTAAAAATAAMGAATTTTATATTTGAAGTTATTC 

TGGATATTCGCACCATTTTAGCTTCTGAAAAAAATGCAACTATGAAATGAAGACCTCATA 

TATTTTCATTTATCAATATAATGTTAAAAGTTTCATTCCACCGGGTGTGGNGGCTCACAC 

TTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGATCGAGA 

GTATCCTGGCTAACATGGTGAAACCCCGTCTCTCTAAAAAAATTCNNNAAAANAAAAAAA 

AAGGAA 

Sequence 61 3 

CGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGTAGTTTTCTTAGCCATTTTTC 

MTAGT CTACACAGTGTTTATGTTTCCTTTTATTTGTGTATAGTGGAGTAGAGGGGAGGT 

TTTTTTATTCAAATAGAAGAAGCTAAAACTCAAATGCAATGTCAGATCTCANAATAAACT 

GACCCAATTTCTGAMCCCAATAAACACATTTCATTTGTMTATTCTTTATTATATAGCT 

CTATGAAAAGTAATTTGTGACTTTCGATCTTAAAAGAGAGTTTTAAAAATACACAGTAAA 

TTGAAAGAAAMCTACTACATTTAAAACAGTATTTTCCTGAAAACATAGAATGAAAATGC 

MGTATTTTGTGCATGGCAGCTGTTTTTMGGAACCAATGTTATATATGGNGAATTTTGT 

GGMGACTATGTCTCTTAAMTATTTCTTATMAATANCATGGCTTTTTAATAGCTGGGA 

ATCTGANGNNGGATTTCCCATGAAGACCTTAAATGGCTNNGCAGGAATTATAAAAAAG 

Sequence 614 

ACTTAGGGCGAATTGGAGCTGCCCGCGGTGGCGGCCGAGGTACATGCCTGTAATCCCAGC 

TGTTGGGGAAGCTGAGATGGGAGGATTGCTTGAGCCTGGGATACTGATATTGAGGCTGCA 

GCGGGCCGAGAATACCACTGCACTCCAGCCTGGATGACAGAGTGAGACACTGGCTCAAAA 

MAMAAGAAAAAGGAAGAMAAAGTTTAMTCMTGAATGTTCTCATTTCTAATGAAAT 

AATGAAA CATTATT GGGAGAGTTATAGTCATAATCATCTTACTGCACTATCAATTAAAAA 

ATACATCATTTTTTAGAGCACMTATATACCATAMGMTT^ATTCAAATAGTCTAAATAT 

TACGATCAAATTTTTMTAGACTTO 

TCAGCTCAAGTTGGGATTCACACCTGTAATCCCAACACCTAGGGGGGC 
Sequence 615 

CCGCGGTGGCGGCCGAGGTACACTGTGTMGTGGTCAMGATAGACATGGTTTTATTTAC 

AAGGAAATTTGCTGAAGTGTAAATTATAACACGAAGAGATGGGAGGGAGGGGTAAACACC 

TAAATGTCTAACAACAGAGAATGGTTCTCTGTTGATACAAAATTATGATACATCAAAAAA 

GMCAACAATCAAATTCTCTGAGAATTCCCATTACAGTTAAAAGGAGCTCCCAGCCAGGT 

GCGGTGGCTCACGCCTGTAATCACAGCACTTTGGGAGGCCGAGTCGGGTGGATCACGAAG 
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TCAAGAGATTGAGACTACCCTGGCCAACACTGTAAAACCCCGTNTCTACTAAAATACAAA 

AATTAGCTGGGCCGTGG 

Sequence 616 

TTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACTGAATAACTGCCAATGCCA 

TCTGCCTGTGGCCTTCTCAAGTTTGTCTGCACCTGTGGTTATCCTGACTTCAAACCCGGG 

GAGACAGAGGCTAGAAGAGGCAGACAGCTCTTGTGTATTCTCCTGTCCAGTGCAAAGAAC 

ATCTGGAACTCTGAGCCCTAACCTTAAATGCAAGACCTNATCTGCAGGTGTTCCTNATCC 

TTTTAGCCCCTCAGTGATGTAAGCAACAAACGTCACCCANCTCCTGGGGCACACTTNACT 

CCCAGATGAGCTTGTCCTGGATTTGCAGGGAGCCTGGCTCCC 

Sequence 617 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACMGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCMTTACCAGA 

GGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGC 

Sequence 618 

GCCGGGCAGGTACAGATGGGGTTNCACCGTGTTAGCCAGGATGGTCTCGANTTCCTGACC 

TCATNANGCATCCANCTCGGCCTCCCAAANTGCTGGAAATTACAAGGGCGTTGAGCCCAC 

CCG CACCTGGGC CAGAATCTTACATATTTCTTAAACATCATTAAATATATATTGTATTTT 

TTACI I I I I I I I IGAATAGGGGTCTTGCTATGTTGCCCAGGCTTGGTTTTGAACTCCTGG 

CCTTNANGAGATCCTCCCGCTCTCAAACCTCTCAAAAGCAATGGGTA 

Sequence 619 

AGGTACCCCATTTTATGCCATAAGTCAGGTTTCTCCCTCAATAGCCCTTTGGAACTCTCA 

AGGTCCAGAGTGGCATCAAACCAACTGACACATGAGTTGATACATCATGTGCTGCCAACA 

GAGAAATTAGTCTGTGCCAAACTCAGCACAATCCTGCAGTTCAAACCAGAATTTCAAAAA 

Sequence 620 

ACCAAGATT TGM TCATGCTTTCAAMGCTAATGTGMGTTAGACATATTTGGTTTCATA 

ATCACAGAATTTTAAAAACACCAGGTCTGCAATATTCAGAAATCACCATTAACGCTCTCT 

TGACACATACAATCMTTTCACTTTAGATCGCTGATTTTCTTMCAACTGATTTAGTTAT 

TTCTGAATACTGCTAGAAAATTCAAAATCTACAATTAAT 

Sequence 621 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCAAGGCATTGTTCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTNTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATNAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGMTTTCACCATCACCAACCTACCATATTCCCAGGACAAAAGCCCAGCC 

AGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

Sequence 622 

NCCGGGCAGGTACTGGATGACAGCAAGTGCACACATCAAGAGAAAGTTACCATTCAGAGG 

TGCAGTGAGTTCCCTTGTCCACAGNGGAAATCTGGAGACTGGTCAGAGGTAAGATGGGAG 

GGCTGTTATTTCCCCTAGGTCATCTCTTACATTCTAGTTCTGGTGCTCTCTATCTGTTTA 

AGACAMCCCTTGNGCACCTTTCTCCCACCCCTCCCTTTCTCCCTTGTCTCCCTTGAGAA 

AACAACTNCAGTTCTCTGCCTGCACCATGACTGTCGATACGCGGGGGCAGTTCGGCGGTC 

CCGCGGGTCTGTCTCTTGCTTCA 

Sequence 623 

AGGTACAAGCTGTGCACTGCAAGGTAACCACGTGGCCAGAGGCACATCCCTCCCTCACAT 




WO 01/070979 



PCMJS01/09126 



TABLE 1 

100/467 



ATACTGAGTGGTGTAATGCAGTCACCTTGTCATCTGGCAAGAGGTGATCGATGGACACAA 

ACTCCTCCCGGAACTGCCCCTCCAGCGAGCTCACTCTGAGGTTATCTGAACTCACATAGC 

TTGGGAAACCCAGCTGGGCACGGGCAACATTTGCGTAGTGACCCTTCCAGTCATCGGAGC 

ACATGGTCTTCCACGAAGCAGCTGTGAACACCTGGAGCACGGCATTCTGACCACTCACCC 

GGACACAGCGGTACCTGCCCGG 

Sequence 624 

CGGGCAGGTACTTTGCAAGACACGCCTGGCTACGAACAACATGGGACAATGGGCAGCCTC 

GCTGCACTGNACAGAGGAAAGGAAAGAGGCCTTGCAGCCACTGCCTGGGAAGGAGCAGCA 

CATTCTGCATTAACCAGGCATGCCTCACTCACTGCAATCCCCAAACAAGCCCAACTCTCC 

GTGTTGATTATTCTTACCATACTCCACCAGMAGCAGCATGATTTTCTGTCCTCAAATAC 

TTCAGATTCCAAGAGAACTGCACCTTCTAGAGTCTCTACTGATAACCTCAGNCACTTACC 

CACTTGAAGCATNAGCACACACTTAAAAAGGAAA 

Sequence 625 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCTTCCAACCTTTCT 

TCTGTGCATMTCATCTAGGTGTGGTGCTTACATTTTCTTTTGGCAGTGTTATCTTAGTA 

TCTTCCAGCATGGTTTTCTCACCTGATACTGTAACCATACTTCCATATCCTCAAATGTGT 

TGTTTTCTAAAATAAC I I I 1 I I I I I CI I I I I I I AGAGACGGAGTCTCACTTGGCCAGGTG 

CGGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGT 

CAGGAGTTCGATGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGC 

GCACGCCTGTGATCCCAGCTACTCGGGAGGCTGACGCAGGAGAATCTCTTGAACTTGGGA 

GGTGGAGGTCGCAGTGAGCCGTGATCGCGCCGCTGCACTGCAGCCTGGGTGACAGAGTGA 

GACTCTGTCTCAAAAAAAAAAAAAA 

Sequence 626 

AGGTACGCGGGACATACTCCCTAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATG 

CCGGGAGGCTTCCTGGAGCAGGTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATA 

AAGCATACTGGGAGCTGAGCTGCTGTTTAGTMTTAMAATCCTTTCCATTGTTTAGAGC 

TCAGCACCTTTGTGCATTCATATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCAC 

TTCTGCTACTGCAATGTATGTCTACAGCTGACAAGTCTTCCTTGGGAGCCCTACGTAGCT 

C I I I I I I I 1 CTTTTCTTTC I I I I I II I I I I GAGACGGAATCTTGCTCTGTCACCCAGGCT 

GGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTC 

TCCTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGGTGCCCACCACCACGCCTGGNTAC 

I I I I I I I IGGTAT 

Sequence 627 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTCTGCCATGAAGGT 

TCTGGGGTGGAGAGGGAAGCAATGTATATCCTACCCATGGTGATTGGTCCGATGGAAGTC 

ACATCCTGATGGGAAAAAAGGACTGAGCCAGAGTGGACTGTCTAAACCAAAATGGGATAA 

ACAAGCATGGCATGGAGCCAAAACAAATGGCTAAGTCAGAGGTCCTAATGCAGAAGGCTG 

GACAACTAGGATGGTGGGGAAAGACATGAGCTTGAAGGACTTCCCAAGATAAAGCAGAAC 

TMCCAGAAGAGCCTGTTATAGATTCATATTGGGGGAGTTTGGGGGGTTTGTGCAGGGTG 

CATCAAAAAGCACTCGCATGGAATAAACATATCTTGCACAGGAACATATGACAGGTAATT 

GAATAGTTTGATTTGAACTATGTAAAGACATGATCCTGATGGTAGAAGGATGGTACCTGC 

CCG 

Sequence 628 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I I I II I I I I I I I M I I I 

TTGGGACCGAGTCTCACTCTGTCGCCAGGCTAGAGTGCCATGGCGCAATCTTGGTTCACT 

GCAACCTCCACCTGCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCCCTNACCACCACGCCCAGCTAATTTTTGTATTTAGNAAAGATGGGGTTTCAC 

TGTGTTGGCCAGGCTGGTCTCGATCTTTTGACCTTGAMCTTTNACATAMCTTTCACAT 

TTCCATGACAAAGTTTTAGCAGTAACTTCCAAATTGGTCTTATTCAACTCCAACATTAAA 

CTTTGT ATGT ACCTG CCCG 

Sequence 629 
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CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGATCC 

AGGATGTGATGGGATCTTAGGGCTTGGCTGGAAGGTTTCTCCAGTCAGCCATCTAGCAGA 

GCTGCAGATCTGGGCTGGGCTGTTGGCTAAAGTGCTCTTCACAGACACCTCATTCGGCTC 

TTCCTTCAGCTTCTTCACTTATTTCTTACTCAGTCACTACTCAGCTCCTTGTCCATGTGT 

CCTTGAAGCCATCCTAGGTCTTATTCTGATTCTGAATTCTTCAGTCACCCATAAGCTTCT 

CCTTACCCCGGGAGTCAGTGGGTGTGTGTTCCCAGGTGGACTTAACCATTCTTCTCCTTT 

ATGATCCTTTCCCTTGGGTGGACAAGTGTGATTTGGTTGTAAGGCCATTTTTCAAGTTGC 

CTATACATTGATAAAAGAAATCCCACTAACGGAAGTAGACTGCATGCCAAATTTCAGTGT 

CTTTCTCCAGGGGCCAAGGTTGGACCCANAAGTGCATGGG 

Sequence 630 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTGCTTCCTGGGAGAGCTGACCATGA 

GTCAATTGGCCCACAATAANTTATNAAATGAAAACCGGCCATCATCTGCATCTTATGAGT 

GCACGTCATCAGAGATGTCCACTCCAGTTACAAGAAAGTCCTGAGGGCTTTCTTGGAGCC 

TGANGGGCGCTGGAGGTGAGACCTGGAGGTGAGCAGGAGTTAACTAGGATGAGGGACNGG 

CGCAGCATACAGGAAAAGCTGCCTGGGGGAGAAAGGACCAACAGCAAAGACTGAGAAAAA 

AATGCTGTTGTGACCAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCAT 

GGAGGACTGAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCATGGAGG 

Sequence 631 

CCGGGCAGGGTACTAAGGACAAAAAGACATTTATTCTCTTTGACCCTTGCTGCCAGNACA 

GAAAATGACTTCACCCAAGGACACAGCACTTGCGGGTGGCCTTCTCCACCTCCAGCTATT 

GCTTGGTTTCAGGTGACCACTCCCTTTCTCTTCTCAGGCCTATGGGTGGTAACAAGCTCC 

CATC CACT GCTAGTCTTAGACATCTTTACTTTCCTTGATTGATNCCCTTGACTCTGCCCA 

CATCTTTTAAAATATCCCATATTAAACTTTTTACACCCCTTTGAATGTGTCCTGCTTCCT 

GCTGGGACCATGACTAGTCTCTTCTAGTNGGAATCCATATCACCTTCTGTGATGTAGTCT 

CCAAGTCAGGCAGNCTCATTTCAACTACAGNCTTTCTTTATGCTTCTCTTTTTCCTTTCT 

GGACTCTTACCTTTCTTTTATTTCTTACTCAGCAACAGTGTCTGCCCATTAAAATGCACC 

TTTGCGGNGGNGGTTNGGGTATCTTTATCTCTCTTATTTCTTCCTTNCTGGTGCATTTTC 

ACTGGCATTGCATGGGAATTTTGGTT 

Sequence 632 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACATACTCCC 

TAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATGCCGGGAGGCTTCCTGGAGCAG 

GTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATAAAGCATACTGGGAGCTGAGCT 

GCTGTTTAGTA ATTAA AAATCCTTTCCATTGTTTAGAGCTCAGCACCTTTGTGCATTCAT 

ATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCACTTCTGCTACTGCAATGTATGT 

CTACAGCT GACAAGTCTTCCTTGGGAGCCCTACGTAGCTC I I I I I I I I CTTTTCTTTCTT 

l 1 1 1 1 1 I l IGAGACGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTC 

GGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGT 

AGCTGGGACTACAGGTGCCCACCACCACCCCTGGCTACTTTTTT 

Sequence 633 

GCCGAGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGNGCTAATAT 

NCAANGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAATGGCAGCAGTGACAGGA 

TCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATTTTCAAATCATCAAGGC 

CAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGG 

TGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGA 

GTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCC 

CATAAACACAGCCCTTTCCTGGCTCACACGGGCATGACCTAATTAAGAACCCCCGCGTAC 

CTGCCCGGGCGGC 

Sequence 634 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGTGA 

AATCCC TAAG TCAAACTGTGGCTTATAAGCAGAAATCCTGGTTAGTATTTCAAAGTTCTC 

TTAGCGTTTTCTCCTGCGACTTAAAAGACTTAAAAACGTGAAMGACATGGACGTAAGAC 
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TCCAAACAAMATACATTTCTTTGAAACTAAATAGCTCTTAAGTMGAAAAATTTCTATA 

GATCTTCAAATCATCCCCTAAGCAAAATATTCTCTMTTAAGTATTTCTGTATTTCCATC 

TATGTTCTTCCCAGGCTTGGGGCTGTTGATCAGACCTATTTTTAGGGGTAAGTTTCTAGG 

GGTCATAGAAGATACAGATTTTGACCTGCTTAATGTCMGAGGTTGCACGGTTGATTTGT 

CCAGTTGTGMTTCTATGAATGAAGCTTTTTGCTTAMTAAAACGATATTCCCCTCTGGC 

TGCTGTGAGCACCGGGAGACTTGTTTCGGCAGTGCCTGGGTGCTGGGGCAGGGCCCG 

Sequence 635 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGAGTAAAATAAAAGGGTCG 

GATTTTGTAGGATTCTAAGGAAGAGGCAGTGTGCTCTGTCCACAGGCTGCAAGGTGAGAA 

CCTAMAGAATGAGATATATTCCCATTTTGGAATGGCAATCAAAAAGAGGATCTCTCTGT 

CAAGTCTTTACATTAATGCTGAGTAACAATCTCAAAAGCCTGCCCATTCCCCTTTAGACA 

CATGTGGCAAAAGCAGAACTGAAGGAATGGCCAAGGGGCTTGAACAAGTAGAGAGACCGA 

CAGTCTTTCAMTTTCAGGGAACCCCAGATACATTTTGGGGGAGCCACTGTTTTCCCATT 

TTCCTGAAAAGTTCTTGCAGGTATAAGAAATAGGAATAGAAATTGAATAGGTTCTGGAGC 

CAGGGCTACAAAGGCCCCAGCTCTGATCTGTTAGACTGAAAACCACACATCAGATGAAAT 

TATATNCACAAAAAGGAGAGTCCTTAAAAACAGCCATTTCGGTCCCT 

Sequence 636 

AAGAGCACGTATAGCATGGGGGAAAGAACCTAAATGTCTCTCTGTCCTGTGAGCTGGTGA 

AAAACCCAGCATGAGAACGCAGTGTCAGGTGTGGGACTCCTTCTGCCCCTGCAGTGGGTG 

TTACGGGCGGTGTGCCCTGGCGAGCMGCTTTGATTCTTGGTTCTTTGAGCTCGTTTCAG 

AGGCTGAGTCCCCACATCAGCTTTAGTTCTTGGACTTCCCTGTATTAAGCAAGAATTAGG 

AGAATGGCTGTCCCTGCAGGCGCCTCCCGTAAATCCTGAGCTCTCTGGCGCAATCTGAAA 

CTTCTCTTCTGTTTTCTTTGGCTGTATCAGCCGAACCAGGAGAGGCC 

Sequence 637 

CCGGGCAGGTACCAGGAGAGATCTGAGACANGGTATGAAGTAAAAGATTTAAGATTGGAA 

GTGGAGAGTGTCATGGACCAGTGCCTTTCGGATGGGTGACTTCTGGAATTCTTGTTAGGC 

ACAGCGGAGGTTGGTCCTGTGGGAAAGGAAGAATATTTCCGGGGTGAGGAGACTTCGGGG 

TGTGGGCCGGGTGCCTTTTTAAATTTGGAATGGTGTATACAATAGGGAAAGGATGTTAAC 

TTTGCAGCAGCGGGGATGGTGAATATMCCTGATAGGGACCCTTCCATTTTGTTGGAAAG 

GGGAGGAGGGGTGTGCTACCCAGACCCAGTCTCCTGGNTGTAAGGGTAAGAAAGTGAATT 

GGGAAGAATCCTCAGG 

Sequence 638 

CCGGGCAGGTACCTGGACTCCTMGCCTCAGGGATTTACTGAMCACCATTCTATTTTAT 

MTAATCCTTAACCMGAATTTTAAGGATCTTMATTTTTCTGTGGTTCTATTGTTATCT 

GATATATAGATGATCTGCTGCCATATCCTAAAGAGCAGATGAGGCCGGGTGTAGTGGCTC 

ACGCCTGTAATCCCAGCACTTTGGGAGGCAGACGAAGGTGGATCACCTAAGGTCAGGAGT 

TTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGC 

TGGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACTAGGAAGGATGAGGCAGGAGAATCA 

CTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCTGA 

Sequence 639 

AGGTACCACTTAACAAGGGTTCTCAGCTGTGNGGNCACTGGACCACTGGGATATGCTGAG 

CTATTGCTTAAACACTGACTTAAATAAAACAAATATTTTAAATAATGAGAATGCTACTGT 

AATTAGAAGGCAATCATTTCAAAGTCTANATGGAGGCCAGGGGCGGTGGCTCATGCCTGT 

AATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACATGAGGTCAGGAGTTTGAGACC 

AGCCTGGCCAGTATGGTGAAACTCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTG 

GTGGTGTGCACCTGTAATCCACTGAGGCAGGAGAATCACTTGAACCCGGGAAGTGGAGGT 

TACAGTTGAGCTGAGATAGCACCACT 

Sequence 640 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCCTTTGGAATAATGAACTCCAGT 
TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 
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GATTCATTGACGGATTGATACAGGGAAATGAAATGA I I I I I I I I I I 1 1 1 1 GGTTGGGGGA 
GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 
AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 
ACAGGTGCACACCACCACACCCAGCT A I I I I I I A 
Sequence 641 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCANGGCCTTTGGAATAATGAACTCCAGT 

TGTTCCTCATAGGTGC AGCA GAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 

AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 

GATTCATTGACGGATTGATACAGGGAAATGAAATGAI I I I I I 1 1 I I 1 1 1 I GGTTGGGGGA 

GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 

AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 

ACAGGTGCACACCACCACACCCAGCTATTTTTTATAT 

Sequence 642 

AGGTACCTCGTTTCTGAGGATCAANACCTNAGNGACCGNGTGTGTGTGTGTGTATTTGTG 

TGTGTGTGAGTCCTATTTGGGCCCCGCCTTTCAGCCCTGTCTTGCAGC 

Sequence 643 

AGGTACTTTCAATTTCTGTGGGATAAACTCCAGCTCCAGTTTCAGAACCCACTCTAATTG 

GTTTAAGCCAGGAAAGGGAGAGGGACATGTTGCTGGGAGGCCCCCATCTGGGGCCTGAGC 

TTGGAATCAAATCAGAGGAAGGCAACACATGTAAAGTGCTGAGAGTGGAAGGATGAAGAG 

AGCTAGGGCTTTGTGCCATCACTCGTGCTCTGGACATAAGTGGAGCTGGGATTCAGCATT 

ACCTGCCCCTGTACCTGCCC 

Sequence 644 

CGGGCAGGTACTAGTCCAGGTGTGAGATGAAGGGGGCCTGGATGAAGCAGAGGGTGAGAG 

ACAAGGAAGATTCTGAGGACCTTGTGGCTAGATGTGGGGGTTAAGTCAGGTTCAACTCCT 

AGGCTGGATGAATTGGCAGATGGCACATGAACTACAAGAGAATGGAAGGCAGAACCTATT 

TTGTGGGCAAAAAATAMTTACATTTTGCAATACTGAATTGAGGGGCTTCTTGGAAGTCC 

AGGTGTAGATGTCTTACAAAAATAGAATATTCTGGGCTGGGTGCAGTGGCTCACCCCTGT 

AATCCCAGCACTTTGGGAGGCCAAGGTAGGGGGATCACCTGAGGTCAGGAGTTCGAGACC 

AGCCTG 

Sequence 645 

GGNCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCAC 

CAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGA 

CCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCT 

GGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACC 

Sequence 646 

CCGGGCAGGTACAGGGGCTTGGGGGCTTGGCCAGGCTCTTCTCCATCCATGCCACGGGGC 

TGACAGCCACAGATCTGGAAGCTCAGGCCTAGGAGTGCAGGCTCCGTTAAGCCCTGTGTC 

CAACATCCTGACTCCTAGGGGTGCCCAAGATTTGAGTGGCCACTTTCACCTCTGGAGGAA 

GTAATACCTAAGGCGCTGATAGAAATAGAACTTCCGCTGCCAGGCAAGGTGGCTCACACC 

TGTAATCCTAGCACTTTGGGCAGCCTCAACGCAGGTGGATCACTTGAGGTCAGGAGTTCG 

AGACTAGCCTGGCCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 

Sequence 647 

AGGTACNTGTTTCAGTCACTGGGCTGANNTGGNNCACAGCACAACCTTCATAGCCACTGT 

ATGAAGAAGTANAAGACCCAGACTCTTGCTTTATGTTGGTATCAAAAGTCATTTCAGAGT 

CAGGCTGATCACTCCCAAGTAACCCACTGACTTCTTTACTCCAGCTCTCTGTCTGCTGNT 

GACTCANAANGTNACACTTNATTTTCTCCATTGCTGATATAATCATATCTGCAACATAAA 

AGTGGGCATTTTCCTTTTCTACATCAACAGGCAGCACAAATACCTCTGGTGAGAAGGAAT 

TCNAAGAAATGGTTNTTCTACTGACTTGAACAGCACCTTCATCAGCAGCAGATGTCAGAT 

GGGAAGGC 
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Sequence 648 

CCGGGCAGGTACCACTAGATTGCCTCCCTGTGCCTGGGCMTTTCAGAAAATGGTGGTTT 

TCCTTTCGTTTCCATCTTTTTTAAGACTTAAAAAGTATCTGCTCTCATTTTCTCCT^ 

GCCTCCATGCCTTGACTCAAAAAATGCTGTCTTAGTTGACAGCCTTGAAATGAGTATGAC 

CCTAGCTCTAGTTGGGTGGAAATCACCTCGCATAGAAATAGACCTGGAGGGCCGGGCACG 

GTGGCTCACTCCTATAATCCCAGCACTTTGGGAGGCCCAGGTGGGTGGATCCCGAGGTCA 

GGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAA 

AAA 

Sequence 649 

AGGTACTAGTATGAAGGAAATAATATCCACACACTGATACTGGTCCAGCNGAAACCAAGA 

CCGCTCCTGGTGCATTAACTTTTAACAGAGCANGGACTCANTTCTCTGAAAATAGTGCCA 

TAAAACATGTGCTCCCAGMGAATAMTATTTGGCTTGCTAGMTTTCTGCNGCTTTTNT 

GTAAAAGTTGATTATTCGGTATTAAAGAGGAGTATCAAATATGNGTNNATGNANNAAAAA 

CTTGGAMNAGTANNGGACCNNGGCTTATCTCNTCATTTTCATTCTGCACACTNCAANTC 

ANTCNTTTTCCCATCTTNNTTCCCNTCTCTGNAATTTATCACCCTCCCCCTCT 

Sequence 650 

GGCGATGGACTCCACCGCGGTGGCGGCCGAGGTACTTACCACAGAGAAAAGCCAATAATC 

ACAATATATGTTGTCTTACTGACCCATAACCCATTTTCTGAGGGTGGCAGGCATTGTGCC 

CCCACTGTGAGGTGGGACTATATACATATACAAAGGAGGTGTTTAACTGGGTGGCATGTC 

TCAGGGAGATGTAANGACTTACCTGCATATCCTGGCAGTNTTGAAATGATAGTGAACTGT 

TCNTANGGCNTCCNCTTGATGGCATAGNCNAAACACCAGNATTTTCTTGGNAGAATGATT 

CGGNAAATGCTACATAGAAGAAATGGNTGGTGAGCTNTTACTGTGACTGTGCCCATAGTA 

AGTCATCCTGGACCCTCTGAATCTTATCCCAC 

Sequence 651 

GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I I I I U M H I I I I I I I II I 

TTTTTATGATATTCATCTATGMCCAATTTAATTTTAATTTAAAAATGACTTCT^ 

GCAGATGATTTGGGTTCTAGAAAGAAATTGTGCCAGGCATGGNGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGAGGNGGGAGAAACCTGTNAGCTCAGGAATTGGAGACCAACC 

CTGGNMCGTAACAAGACTTNTTTTTTACAATATTAATAATAAAAAAAGGCCAGGTGCCG 

GNGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCCACCAGACAGGTGGATCATNAGGT 

CAGGAGTTCGAGACCAGCCTGGCCAACGTGGGGAAACCCTGCCTNTACTAAAAACACAAA 

AATTATCTTTGCTTTGNTGGCGGGAGGCTNTMTCCAGCTTCTAGGGAGGTTGAGGCAGG 

AAMTCNCTTGMTCTTGAAAGCAAAANTTNCAATNAGCCCNGGTCACACCATTTGCCTT 

CCAANCTGGGCAACAAGAGCCMATTTTNTTNAAAAAAAAAAAAAAAGGGCCGGCCTTGG 

GGGTNNTCCCNGGAACCCCACNCNTTNTGAGGCCCCACCGGTGGTNNTAGGGNCNGAGTT 

CAAAACNNCCTTCCACTTTTNGMCCCGTTTTTTAMTTCCAAAAATTNCTGGCTGGATA 

AATCCGNAGCCCCTTCTTGNGGGTGNGGNNGGANTNTTTCCCNCAANNGGGGGTGGGTNA 

NNNCCGAMNCCCTTATTTCCCCTNTGNAMGGGNCTTTTCNAAAAAAAAAAAANA 

Sequence 652 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGGCAGAAGGAAGGAAGGGCAAAGA 

AACA ATTCCAGGGCCCTGGTTTCTGGGATGACAGGCTTCCCAACACTCATGCCAGGACTA 

TTTTCCACCTCGGTTCACTATGGG I I I I I I I I CI I I I I I I I I AATATAATGAATTTTTAA 

MTGTGTGTTTGTTGCCCAGATTATTCCANAAAGAGTTGAAGGGAGGAAAGGNGTGCNTG 

GGGTGCNTGGGANTTTTANCCCTCTNTCCACCCNGATTTCTAAGTTGGGGGGGGCATCCA 

MCAGCTTCACCCANGTGCCCAGGCTNTTTTTTGNTNTCCAAAGCCAACCCTTCCAGGGC 

ANGGANGGGTGAAGNTTAGGAGGGCAAAGGTTAGCCTGGAGGCTGCAATTAACAAGAATC 

AAANTGGGGTTTMGGATTCTCACACCCAGTTTGCTMTTTCAGCTGGTCCTTGTAGAGG 

TGACACCTAGTAGGACMCATGGNTTTTNGGGCAGGGCTGGGGTGCTGGTCTCTGCTTTC 

TAGGGTAGAAAGGAATCATACATTGAAAATGCTTAAATCGATGGAATGATTTATGTTCCT 

NATCTTTCATCTTTTTCTGNGTGGCTGGTTTTCTGCCANCCTTACTTGGACAAGCACCAT 

TCTANACCTTTTCTNTAGGCATNTCCNAGAANGNGAAGTNGAAAGGAAGAAAAAAAACTT 
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ATTTT7TNNGNCCAAAAATTNGGCAAAAAAAAA 
Sequence 653 

TCCACCGC GGTG GCGGCCCGAGGTACGCGGGGGCCCTTCTATCTCAGGATGTTTGCACTT 

GCTATTTC CTTTT CTTAAAGGCTCATCCCTAGATATTTGCATGACTGGCTTCCTAATTTN 

NTGTAAGCTTTTGCTGAGAAGTTACTTTACCAACTGTCATTGAGGTTTTCCCTGAACATC 

TTAGGTAAGATAACAAGCTCCCCTCCTTTTCTTTCCTCACTTCTTGGTATTCCTTATCTC 

GTAACI I I I I I 1 1 INGGGGGGATNGANACTTNCGTNTTGNTTTTGTTGGCCAAGCTTGGA 

GTGCANNGGGTGCANTCTTGGCTTNACTGAAACCTCCACCTCCCGGGTTTAAAGCGATTC 

TTCTGC CT TNAGC CTNCCGAGTAGCTGGGACTACGGGCAAGTGCCACCACACCCAGCTAA 

i i 1 1 I iG TATTTTTAAGTAGAGGTGGGGTTCACTGTGTTAGGATGGTCTCTATCTCCTGA 

CCTTTTGGGCCACCCACCTCGGCCTCCAAAGTGCTGGGATTATAGGTGTGAGCCAGTGCN 

CCCGGCCTCTCATAATTTTCTTAAATT 

Sequence 654 

CNAATTGGAGCTCCCCGCGGTGGCGGTCAGTTNGTCTTAGAGATACCCATGAGGTCACCT 

ACTCAAAATGGGGCTCAGAGTAGCCTTGTCCCATTCTTGTCCAGTGGGCGCAGCTACAGT 

CTNNCTGGNNNGGAGTGACTGGAGGCTGTCCCCACGTCCCACTTCAGTGAGGCATTCATG 

TGCACCCAGCACACTTTCTAGCTTTATTTGCCTGGAGGGGAAGATTCTCCAGAACCTTGT 

TAAGATGCACAGNGNGGGCCCTTGGACTGGCAAGTGTGGCCTTNGGCAGTCCCTNGGAGC 

TTGTTAGGMTGCAAMTNTTMGCTTCTTCCTACTGNATCTAAAGGTTGANTTTAMCA 

AGATCCAGCTTGTTTCGTTTCACATGAAAGTTGAGGCACACTGCTCTAGAAAGTTCTTTT 

ATCTTTACTGGCCCACCAAAGTAATCAAACTTTGNGAAGTACCCTCGGNCCGCTCTAGAA 

CTAGTG 

Sequence 655 

GCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGATATGAAGTGAGGTTAAGTCAGA 

TGGMTGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTTGTGN 

NCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAATAGGT 

GCTCGAMAACATCTGTTGAATGAMTGAATTCTTTTGTTTGCAGTAGGGCAAAGAAGGG 

TAGAGAGGAACNACTTTGCCAAGGTGATNTGTAAATGTTGCAAAAGGGTTTNGGCCAGAA 

AATTCCNANAACCGATTNGAGAGGCAATACATGTTAAGGGACCTNTAAGATGTTTCACAA 

CCTTGGAMTMTTAMMGAACTTTCTACTGNTTTACTTATTTCCCACTCCTGGCTGCC 

CCTCTTGGGTGGACTGCCTNCTGTTGGAGGGAATACTGNGTGAGACACATCTTTTAGTAA 

AACAGAAATGTGAAACCAACTTGCAGAAATCACAAGCACACTGTTACCAAATAGGTCTTG 

ACTGG CTCCC TTNCTGGGGGACAAATGTTTTGATMTGTCTGTCAGTAGATTCAGTTCCC 

TATT TCI I I I I AAGACTTGATATTTAAGAATACTGGTTCTTTTTTGGCCAGCATCGCAAN 

GAAG I I I I I CTTTAACTTTTGGGCCAAAAAAAAAAAAAAA 

Sequence 656 

CGMTTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGATGTTTTCCTAAATTTTAAGT 

CAMTTCTTCCTTGACACATACCTATTTTTATTTTGTTTTGGTTCTCATCTCTGTGAACA 

GAGCMAGCATGCMCCATTGTAACACTTtCATTTGTTTTTATAAACTCAAGTTCTAGAG 

TTGGATTTCATGATTTGCATAACTCGGCATAGTGTAAGTGCTTGTAGTTTTAAACAGAAA 

AAGAGGGAAGAAATGACNATTCCANAAAAAAAGATCAAATCTTATGACTGTAATTTATTA 

AGGNATCCMTGGAATTCTTTCCCTTTTTCTTTC I I I I I I I I I I I I I I AAAGAGACAAGC 

TCAAGTTCCATAAGCTGGGAATGCAGTATCATGATCCATAGTTCACAGCAGCCTTCAACT 

CCCTGGGGTTCAAGGNGATCCTAAGAACTTGNGGGCCTCAAGCAGTCCTCCTGCCTCAGC 

CTGCCAAAGTGCTGGGGATTACAAAGCATGAGCCACTGCTCCTAATTCTTAAGAGATA 

Sequence 657 

GAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGAAC 

TTTGGGAAGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCAGGACCATCCTGGCTAACA 

CGGTGAAGCCCCGTCTCTACTGAAAATGGAAAAAATTGGCCGGACCGTGGTGGCGGGCGC 

CTGT GGTCCCAGGTGGCTGGATACACGGGTGTGCACCACCATACTGGCTGATTCTTGTAT 

TTTTGGTAGAGATGGGGNTTTGGCCNNGTTGGTCCAGCTGATCTTGAACTCCTGCACCTG 
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CCTNGGCCTTCCAMGTGTTGGGATTACCGGTGTGAGACACTGGCCCCTGGCTATATTTT 

ACTATTTGGAMTCACAATGCATCTTAAAATTGATGGCTTCTTGCMCCACTTTCAACCA 

GGTGCCTGTCATGATTTAAGTGCTAGCATCAAGGCAGGTTAGTTATGAAGAAATAGAGTG 

TGTGTTTATATACTCACACAGTTAGMATCGACCCTTTTAAAMTTATTTCTTrTTGAAA 

A 

Sequence 658 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCNGGTACCGCGGGATATGAAGTGAGGTTAAGT 

CAGATGGAATGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTT 

GTGTTCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAAT 

AGGTGCTCGMAMCATCTGTTGAATGAMTGMTTCTTTTGTTTGCAGTAGGGCAAAGA 

AGGGTAGAGAGMANCAACCTNGCACAAGCTGNTTGTTNAATGTTGCAAAAGGTTTAGGC 

CAAGAAAANTTCNAAAACCCATTNGAAAAGCATACATGTTTAGTGGAACCTTGAAAATGT 

TTTCACAACCTTGGGMATMTTTAAAAGTAACTTCTACTGGTTTTACTTATTTCCCACT 

CCTGG CTGCC CCTCTTGGGGTGGGACTGCCTCCTGTTGGGAGGGGAATACTGTGTGAGGA 

CACATCTTTTAGTAAAACAGAAATGTGAAACCNACTTTGCAGAAATCACAAGCACACTGT 

TNCCMTTAGCTTGACTGGCTTCCTTNCTGGGGGGACAAATGTTTNGATAATGTCTGTCA 

GTAGATTCAGTTCCCCTATTTCTT^ 

GCCACCTCGCANTGGMGTTTTNTTTACTTTTGGCCAAAAAAAAAA 
Sequence 659 

CCGCGGTGGCGGCCGAGGTACTGGTAAAGGGATAGTCACATAGATCAATGAAAAAGAACA 

GAGMTCTGTGAACAGACCATGCAAATATGCCTGCCTGGTTTTTCACAACAGTGCAAAAG 

CAACTCAGCCAACAAAAGACAGCTTTTGGCCAGGCCGCAGTGGCTCACTCCTGTAATCCC 

AGCACTTTGGGAGGCCCGAGGCGGGTGGATCAACGAGGTCAGGAGATCAAAGACCATCCT 

GGCTAATATGATAAAACCCCCGTCTCTACTAAAAAAACACACACCCCAAATTGCCCGGTG 

TGGTGGCAGGTGCCTTTNGTCCCAACTNCTTTGGGANGGTTAAGCAAGGGANAATGGCNT 

TGMCCCGGAANGGAAANCTTTGCCNTGGGGCCCANATTTNNNCCCNTTNNNCTTCANCT 

TTGGGTGNANAAAAANCNGGACTTGGTTTCAAAAAAAAAAAAA 

Sequence 660 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGAAGGTGAGCTTTGAA 
GATGCAACATGAATTTGACAGTANAGATGTAGGGAGGAAGGAAGGCAGGACAGGTCAGAC 
AGAAGTGCAGGAACAGCCCAGGCCTTTGCAGCCTTCCCACACCCCCTACAAGACCTGCC 
Sequence 661 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGGACT 

TGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTTTCTCCCCGCTTGCCGGGGTGGTCC 

TCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTGTGGAGAGGGACTGACGTGTTTGGA 

GGGAAATGGGAATGTACCT 

Sequence 662 

AGGTACTCCAAGCTCTGAGACCACCTCTTCTGCAAAGCCTTCCTGATTCTGCAAAGAACA 

GGTAGGCATTTCATCCTTGGGACCTCACAGCAATTCAGGACACATTTGTGTCCCAGCCCT 

GCTTGGCTTGGCTGTCTCCATGAATATACACTTTGTAACTTCTGCACCAGGCATCATACC 

AAGCACACAGTAGGCACTCCTGTGTTTTTTGAATAAGTGACTATATCATCACCACATTTC 

AMTGCGGAATATATGAGCTACTAGAAMGACATMGGGTAGATTTTACATCTTTATTGT 

ATCCTAGATATACAAGTCTATTACTGCCTTTTCCCATGTTCTGTCAACATAGCATAAAGA 

ATGTGGATTTACCTGTTAGAAATTGAATAAGCGGCCGCTCTAGAACTAGTG 

Sequence 663 

ACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACMGTTACACGGCAATTT 

TNTTAATAACTCCCTTTACTATGTCAGACAAGCTATGTCAAGCGTCTTCTGTATCCTNTA 

CGGGGAAMMMAMGTTMCCAGAGCCMATGCTTGCTTTCAAAGATAACTTGCCATC 

CTGAMATATAATTTTTACAATTCAATAACACCTTTTTGAAAAATAAAATAATNTGGCAA 

AAATGCCCATGCATTAACAAACCATTTTTTCAGTTTAATC^ 

ATGTATTTTAAATAAACAAAGCAAAATTCAACTAAAAATACAATCTGGATTCCATAGCCA 
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ANGGTTTTATTTACMTTCCTANTAGGAAGGCTTTATTTTTAGCTNTCAAATGGGGNNGG 

ACCTATMGGGMATTTAAACCGTTTNCNTTGAGTTTTTNTNTrTNAAGGGGAANGGGGG 

AGGANTTCCCAAAATGGGGAMGGGGAAMMAGGGGGNAANNCCNTTTTGGCCTTTTNN 

GGNANTTTTAAAAMAMNTTTNCCCCCCCGNGNCCCCCAAMAAAAAANNAANNTTTTT 

TTNAAAAAAAAAAAA 

Sequence 664 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTGCTGGTTTTCTGGT 

GTTCCTACAAGCTGCCTAGGTCTCTTTTGTTCTCAGCAGTTCCCAGGCATGCAAAAGTTG 

CCAGTTCTGTCAGCATTCCAAGTCAGGTAAGACAGAAAGCCATCTCTTAGGCAGTCCCCA 

GAAMGCTGMAGGTTGGATATACTTTCTACTCTTCTCTTTTCTTCATGAGAGAAAGGCC 

ATGTGGGCATTTTCTCCCAATAACACTGAGTTCTGTTGTCTTCTGTCGGCTGTGCTGCAG 

GTTCTCAGGTGCTGCAGTTAGCTGCT 

Sequence 665 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGTAAGATTTCCTGAAGGGATCCATAGCC 

AAAACATGTTTGAAGGCCACTGGGCTCGCTAACTTCTAAAAGCACCCCAGTTCTAGCAGA 

CATCCTAAGGAACATTCCCAGGAAAATTCCAGCCTAGAACCTCCTGGGGTCTGACAACCT 

TAGAGAACAGTGCTGGCTTTGAATGGGCTTGGGGGCAGCCTCGAAACCCTCTTTCCAGTC 

TCCATGCAGGCAGGGGAGCTCCTTAAGCAACACATAGGACATTTCTGGGAGAAATGGGAT 

CCCCAACACAATGAACACTATAGATTTTAATGGTCTATATGGTTAAATACACAAGGCCCC 

TCATTTCCAACCCCGCCTGTTCATCTGATTCATCTGTACCTCGGC 

Sequence 666 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGACATTCCCCCAGTGGCT 

GAAGTGGCATATGAATTATGAAGTTGGATCATTTGGAATGAATGTAAGAGAATTGCCAAG 

GGCTCCTCCTACTCCAGAGAGGAAACCTCATCCAGGGCCATGAAGCCACTTCCTCACCAT 

CTGTGTGCTGCTTAAGCTAATGCTGCGGGAACCATGGTTCCTTGGGAGGAATCAAGCTGA 

CTCTTGGCATGAGATTCCTGCCTTCCTAGGGTTGAGAGCGGCACTGCCATGGCTTCTCTG 

GACGACCCAGGGGAAGTGAGGGAGGGCTTCCTCTGCCCTCTGTGCCTGAAGGATCTGCAG 

TCTTTCTATCAGCTTCACTCACATTACGAGGAAGAACACTCAGGGGAAGGACCGTGATGT 

CAAAGGGCAAATTAAAAGTAAGAGGCGAGGACCTTGCCTACCCCCTGCTTGTCGTTGAGA 

GCTTTAACTCACNGGATAGTTCTTATCACTTTTGGTGGTGGCACAGGNATATGATAATTA 

GTAGTAGCCAACAGATGACTAGTNGTTGTCATGTGCCAAGCGTTTTAAAAGTTNCCTGTT 

ATTAATTTCATTA 

Sequence 667 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAGAG 

AAAGCGTGAGGGCTGGGCCTGCGGCGGGCTTTAGGGAGTGGTCCCTGGCTGTGGATAGAT 

CTGCTGATGAGTCCAGGCCCCGGTCCATTCTCCTCGCGCTGCAAGGATGCTCCTGGGATT 

TCGGAGAGGCCGCAGGAGTCATTTCAAACACATCATCCATGGCCTTTTACCTGCAGCCAG 

CGTTGCTCCGAAGGCAGCTGTGCCACGCACACCTCCTCCCCGCAGCCCCAACCCATCTCC 

AGAGAGACCAAGATCTGCTCTGGCAGCAGCCATTCTGGCGACAACATTGAC 

Sequence 668 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATAGGAGAGGGAGGAGGG 

CTGGCAGCTGAGTAGCCAGAATCAGGCAGACGGTGGTAGCAGAAGTCAGAGGCCGAGGGA 

ATCAGGGAAAGGAGTTCAACCATGGAGGACCTTGCTGGCCAGGTTAGAGACTGTGGACTT 

TTGTCTGGGTGAGACAGGAAGTCACTGGAGGGCTGTGACAGAGCTCTGAGGCTGTGAGGC 

ACTGCTCTGTCAGTGCCATGAGTGGGGAAAACAGGAGCTTGCTGCACTGGTAGAGACCAC 

AGATAATGATGACTTGGACAGAGCAGCTGGGAGAGAACTAGTTCAATAACCCTAACACGC 

CTCTCCATTCTGCATTTTCCCTAAAAATGTACCTGCCCG 

Sequence 669 • 

CCGCGGTGGCGGCCGAGGTACCTGCCCTATCTTGCTGAATGTTTTATAATCTAATAAAAC 
TCAGATAAAGACCCAGATGTCACACCTGAACAGGAAAAGCTGAAAGGAAAAGATAATTAA 
AATATAAATCAACAGAATCAAGATTTTGAAAAGACCTAGAAAACTTGAAGGATTACTGAA 
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GCCAA GCAAGAGGAAATGATAGGATTAAAAGTAAAATCTTGTGTTTAGA I I I I I I I II I I 

TTTTTCAAACGGAGTCTCGCTCTGTCACCAGGCTGGAGTGCAATGGCGCAATCTTGGCTC 

ACTGCAATCTCCATCTCCGGGCTCAAGCAATCCTCCCACCTCAGCCTCCCTAGTAGCTGG 

GACCACAGGCATGCGCCACACGCCTGGATMTTTAAAATATATACATATTTTTGTAGAGA 

CAGGGTGCTGCTTTATTGCCCAGGCTAGTCTCAAACTCCTGGCTTCAAGGCATCCTCCTG 

CCCCAGCTTTCCAAAGTGCTGGGATTACTGGTGTGAGCCACTGTGCCGGGACATAAATAG 

TTATGCTGTATTGGTTAAGGAATAATGACA 

Sequence 670 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAA 

GGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATC 

TACAT7TTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGG 

TCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATT 

CCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGC 

TGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCAT 

Sequence 671 

CCGCGGTGGCGGCCGAGGTACAAGGAAGGCCTTAAAGACTGCCCCACTCTCCTTGTTTCC 

CATCCCCTGTCCCTTCTACTTCTCACATTCACCACTATGTGCCCTAGGACAAAATCAAAT 

GTGGAAACATTTGGTCATGTCTACTTTGTCCAAGGGTGGGAGTTCTTGAGGAATTCAAGT 

GGGAACTAGAACAACTTTCTACCCTTTCCTTCCCTTCCTTCCTCCCCACCTCTACCTAGA 

AGCCCATCAATCACTTTGAACTTCTTGGAGAAAAAAGGAAACAAAAGAAAAAGAAAAGGA 

GAGGCTGGGTGCGGTGGCTCATGCCTATAATCCCAGCACATTGGGAGGCCAAGGTGGGTG 

GATCACTTGAGGTCAGGAGTCGAGACCAGCCTGGCCAAAATGGTGAAAACATGTCTCTAC 

TAAAAATACAAAAATTCGCTGGGTGTGGTGGTGGGTG 

Sequence 672 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACACTGGTGGGG 

GAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCGCAGGGCCTCTACGGACCTTACTAGA 

AAAATGAAACCTGATGAAACTCCTATGTTTGACCCAAGTCTACTCAAAGAAGTGGACTGG 

AGTCAGAATACAGCTACATTTTCTCCAGCCATTTCCCCAACACATCCTGGAGAAGGCTTG 

GTTTTGAGGCTTCATGCCAGAAAGGGGAATGGGGAATGGCTGCTTAACGGCATGTNTTTT 

TT 

Sequence 673 

CGCGGTGGCGGCCGGCCGGGCAGGTACACGATGAAACGGGGGTAAGGAAGGAGAAGAAAA 

ACATTGAAAGGCATTTGACAGGGTAAGGTTGTATTCCCCAGACAACCCTGTCAAGCAGCT 

CTGAAGGGATGATGAGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGAC 

CATGTCCTATTTTTTGGAGGCGTTGGGGGGAATTGTCTTACGCAGCACCCAAGCACACTG 

CTATGCAATGGACCACAGATAGGAAGCAAGCACTGCATTTGGCTCCCCCGCGTACCT 

Sequence 674 ® 

CCGCGGTGGCGGCCGAGGTACTAAATCATTAATTCATCCTGAGCTAGTGGCTTTATTAAT 

GAGTATCTCACAAAT ACCAC AAAAATTCAACCTGGCCATGTGGAGCAATATAAAATTATG 

GCATTTCTTGGTATGTTTTTCTCTTTGGCGAGGAGACAACTTGATCTTGTGTTTCCAGAA 

GCATGTTAATTTGCCCTGCTTGCAGAATCTCTCTGGCTTGAAAGGAGATTATATTCATGG 

CAGTCTGTGAATTTTCATTTTATTTCATTTATTTATTTGAAGACAAGAGTCTCACTCCAG 

CCTGGGTGACAAGAGCAAGACTCCCGTCTCAAAT 

Sequence 675 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGGAGGTACCATGAGGGGAGA 
ACCGGCAAGGGGTGCCATTCTAGCATCTGGGTGGGAGAGAGGAGGCTGAATGCCAGGGGA 
AACTTCTTGGAAAAAGTGATGCTGAGTTAGGACAATTTAGTCAATGAGAAGGGATCTGGC 
TGTTCCTGGCAGTGGAGACAACATNTTTAAAGGCATGGGAGAATATCTAAAATTTACCCT 



Sequence 676 

AGATAATAACATCTGATATCCACATGGGGTCTGGAGGNGCAAGCCACCTTCCTTTCATCC 
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CACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCTCTCTGTTGGAGGCTAAGGGCCAGTGT 
TGGAAGGAGCTCGGGTGGAAAGTGTGGTCTGCATGAGGGGCTCCCGTGMTAGAGGAGAG 
GGGTGGCNGGTACCTGCCCG 
Sequence 677 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACCTGCCTC 

TGCCAGATACCCCTGAGGGAAGAGGATGTTCTATAACCAGGCCGACAGGTTAGCATTTGT 

GMCACAGTTCTGACGTTGTTGGGAGGGTTTGTTTGCCAGAAACATCCCCATGCGCTACT 

CTTTCAACCAG AGGTCAAGMGTCCTTTACTTTTGTGTC I I I I I I GTTTGTTTGTTTGAG 

ACGGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGGCGCAATCTCGGCTTACCACA 

ACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATCA 

CAGGTGCCCACCACCACGCCTGCTAATTTTCATACCCGCGTACCTCGGCC 

Sequence 678 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGTGGTATTTCA 

CATAATATTAACCAGC TGTTA GCAATGACTGATATATACTTCCCATTGAAAATGATGTAA 

GGT^CTGAA AG GAT^TC ATTTT"G ACAATTTT AT ATC ACATATTT ATATTT ACCTT AG GTGGT 

TCTTTTTMTGTTTTMTTTGGGACCACACTMTTTCTMCTTGGTAACTCATCTCT^ 

CMAATTMTACCMGCCAAGAAAAATGGTTTCATGAATAGAATCTACTAGTCTTTTATA 

TCTT ATAAT GGTAGATCACTGATGAGGTAGAACTCCATAAGAGCTTCNCTCTCACAGTNA 

MGGTTTTGGTTGT GCATG GATTACACCTGGTGAAAGTTGGTTAGTATTTGTCTAAGTGG 

CTTMGACAAATTTATTTTGAmGTATTGTGMTGACTTTGCGMNCACCCAGAATm 

TNCCGCTTCGTGTGTNGTGTGTGTGTGTGTT 

Sequence 679 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCCACATAATAGCTC 

AGT GCATGC AATTTACAAAGTAATAAGTGAAATGCTCCCCATAGTTGACTATAACATTTC 

CTCATTTTTCTCTGMTTTGCTTTTTAAAAAACTCTTCCCCTTGCCATTCCCTTCCCCAT 

TCCAGATTGTAACTGCTTCTTTCCAGCTGCATCAGAAGAAGGGGACTTTCCATGTAGGTG 

TTATTCTCAGAAAAGGCCAGAAAAGACCAGGTCATGGTGGGGATGATTTGCTCCAAGCAT 

AAAAGAGAATTGTGATGGTTCAGGAAGACTGGAAAATAACGAGACTGGAAAGAAATGAGA 

AGGGCTTCAGAGGAATGGCACATTGAAATAAAAGGGAAGTGGTAAGAACAGGAACCCAAG 

NGGAATGAANGGGCNCACAGTGGCAGGGATGATTGGATAGACTGTGGAATAAAAATAATT 

TG 

Sequence 680 

AGGTACAMCTGGCTTCTTCTCTTTGTCACCAGCACCTGCTTCATAGTCTCTCTGGAGTG 

CCAGGMCGGGTCATTTAGATTAAATCTCCCATACCGTTCCTGGATAAATACCTCCTTCC 

TGCGAGCCCGCAGGGCCTCGATGACAAGGTCTCTGGCCTCCAGCTCCCCTTCCATCACGC 

TGAGGAGCATCCGCAGCTCGGATTTACTGAGAGTATCCACATCMACTCTTTTTTCAGTT 

TTACMGTGGAAATTMGCAGTCCTCCTCCCCGTTTCTCCTTCCATTGCCAGGCTCAGCT 

CCTCTCACCCCAAGTACCTGCCCG 

Sequence 681 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTGGAAAGTGAGGTGGTTT 

TATTTCTAGTTCACCATTGTCCTTAGGCTGTATAGACCTCTGGAATCCCAGCTTATGTGG 

AGAAGGTATCCTGTTAGACTTCCCTCCTTTGGTCAGCACTGGGCCTTAACTTCTGGCCCC 

TCAAAGCTGCTAAAACTGAAGGCCAGGCTTGCCTGGCTTGGCAAAGGACGTCGGGCAGAA 

GCAGCTTCTCCTCTCCTCTTGTTTCTCTGTTTCCCCTCACCATAGGCTTTGGCCTGGGAG 

TTTTCCTACA 

Sequence 682 

CCGCCCGGGCAGGTACCTTCTTGGTTGCTGTGACTGTCTGCTAGCACTAAGACTGTCTTA 

AGCAGATAGAGGGCAATGGTCTTTGAAGGCAAATGACAAAGCGTGGCCCTGAGCTCCCTG 

ACTGAGTTCATTTGGACTCTCAAGGGATGCCCTGGAGCTAGACTCGATCTGAGTGGTTGG 

ACTAACTCCTCTTTGTTTTTGTATTGMGAGCCAGCTTACCCCGCCATTTNTAAACCTCA 

GGCCAGGAAAACCAAAAAACAAAAAAACCCAACCCAAACAAAAAACAAACCCCACCTTCT 
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TNAGAANTNAGTAANCTTMGGCTTNAAGAATCAACAGNGCCCCTTTGGGNATTAAGGGC 
CATT 

Sequence 683 

CCG CGGTGG CGGCCCGAGGTACTATAAAATACTATCCTAACTTTTTATGTG 1 I I I 1 I IAA 
CTTGTTTTTTAGAAGTTTTGTAGCGGTTTTTAAAMTC 

TGCTAATATCTGTATCTTTTACTCTATMCCTAATTTTTACATTTTCAGAAAMMTTTT 

TACAACAATGTAAAAAATACATGGCCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCAC 

TTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTAAGGAGTTAGAGACCAGCCTGGCCAA 

CATGGTGAAACCCCGTCTCTACTAAAAGTATAAAAATTAGCTGGGCATGGTGGCAGGCGC 

CTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAA 

GGTTGCAGTGAGCCCAAGATCCGCGCCATTGCACTTCTAGCCAGGGAGAGAAGAGCCGAG 

ACTTCATCTT A AAAAAA AA AAA AAA A AAAAAAG GTC 

Sequence 684 

CGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACTCTGGCCCTGGCTTTATTT 

TTAGATTTTCCTTTCCCGGTTGATATCGGAAGGCACAGAGGCAGGAGGTGGGGTGGATAG 

TAATGTGTGCCCCCTTGGGGGTNANAGTGAGGTGGAGGGGGATGTTAATNACCATGAGAG 

GCAGAGGGTCAGNCNANTTTCCANNGCTTCNNGCTTCCTTTAAATGANGGAAAACACGTG 

CANGTNTTAGGAGACAAAGGAAGGGAANTGACTGTTTCCTGGCCTGGTNTGTGGGCCCAG 

TNGNCTGNTNCNTTCAGTGNCTNCGTGCANTTNGACTNTACACNTANGNNGGCAGGCATA 

GGTGTNCGGTTNTGAAAGACNGNNNTCTTTNCACATTCTCTNCTGCTCTAGGGACTGAC 

Sequence 685 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTAGTAGAACTCTA 

GGAACTGACCAAACCCTTTTAACAACCAGGGGAGTGCTTGATGAAGGAAGAGGCTACCGA 

TCTTTCATAAGTATGAATAAGCAGCATGCATAAACCAATTACCTTCCCCTATTCCTCACA 

ACCACCACCACCACACCCACCACCTCTTGTGGCAGTGGGGATAGCAGCCCATGTTCCTGG 

AGTTGCTAACCGGTGCCAGGAGGGACAGTAGGGATCATGTCCTTCAAAATTTAGGGTTGT 

ACCT 

Sequence 686 

CCGCGGTGGCATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGG 

AAATGAGTGTGGAAAGAAGAATGCANAAATCTGAGGACTAGAGCCTGGAGATGGGGAGCT 

TCGAGCTCAGAGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCTTCTGAGTATCT 

GGGACTGCAGGTTATGTGCCACCACACTCGGCTAATTAAAAMAATTTTCTTAGAGACAG 

GGTCTCTCTACGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGTGATCCTCCTGCCT 

CAGCCTTCCAATGCCTTGGGCTT 

Sequence 687 

ATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGGAAATGAGTGT 

GGAAAGAAGAATGCAAAANTCTGAGGACTAGAGCCTGGAGATGGGGAGCTTCGAGCTCAG 

AGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCCTTCTGAGTATCTGGGACTGCA 

GGTTATGTGCCACCACACTCGGCTMTNAAAAAAMTTTTCTTAGAGACCAGGGTCTCTC 

TACCGTTGCCCAGGCTGGTCTCAAACTCCCTGGGGCTTCAAAGTGAATCCCTCCCTNGCC 

CTCAGCCCTTCCAAATGCCCTTGGGGGCTTACAGGCCATTGGAGCCCCACCATGTGCAAA 

NGAAAAGAANAGCMTTTTTGGACATCCTGCCCAAAACAAACAAAGGTTTGGGCAATGGG 

TCCTGGTCAAGCAAAAACAAGTGGGGTTTGGNGGAATAAACCAACCCTTGGGTTAAAAAT 

AA 

Sequence 688 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACT 

TGAAAGGTCACGTAGCTGANACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAA 

AGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCT 

TGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGA 

GCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGG 

Sequence 689 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATACTCATTAGAGTTGCTCGGTGG 
AGATGGAATGATGGTGGGGTGCAGTTAAACATGGCTGAGTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCAGGTCATTCATATTTGGGGGAAGGAACAAAGAGGGTACCT 
Sequence 690 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCTATGAATCTCACA 

GATGTAATAATGAGTTAAAGAAGCTAGGCACAAAAGAATATTACTGTATGATTCCAATCA 

TATAAAGTTCAAACCAGATCAAACTAATCAATGAACGAGGAGTCAGGATTCTGGTTATAT 

TCAGGGATAGTGATGGAAGAGGGCTATAAGGAGGGTGTCTGGGTGCAGGTCATGTTCTAG 

ATCTTGATCTGAGTGGGGGTTACATAGGTGTATTCACTTCATGAGAATTCAGAGGGCTGC 

ACACTAATGATCTGT ATAAT GCTCCTCTATAGTATGTCACACTTCAAAAAAGTTTACAGA 

AACAGTTCCTTCCTAATTTTCACAGGGCCTAAGAGCTAAAAACGCAGCCCCAG 

Sequence 691 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCGCGTAGCTGAGACTTCAGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAAGTGAGAAATTCCTTGAAATTTGCAAGGT 

Sequence 692 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGCAC I I I I I I 1 1 1 1 I 1 1 I I I I I I 

TTTTTTTNCCATANAGATGGGGCTTTGCCATGTTGCCCAGGCTGGTCTCAAACTTNTGAG 

CTCGAGCAATCTGCCCACCTCGGCCTCCCAAGGNGCTGGGATTACAAGCATGACCTGCCG 

NGCTGGCTAAAGTTTCTTATTTATACTTACTCATTCCTCTAATATCTGGATTTCCTTAGT 

CATCTGTCACTTCTCCCTGCATATTTCCTGTGATGTCTTTAGGTCCCTCCCACTNTTGTA 

GTAGCACTCCCTGGGGACCMTTTCGMGGATGCTGAGTCATATGGTTTTTGGTTTTGAG 

AGGGTTGAAAATGGAGACTCAACTCAATTTAGGAGCTATCCCATCATAACTAGTAGCAAA 

ACACGTCACTACTTGAGTCTCAACAAAAGACAAAAAAGGTTTNAAAGTTGGGGAACAAAT 

AGCTGCCAAGG6TTNTTNTTNTNTGACAAAAACATTGNGTTGGGGATTTAAAATCNATGT 

GAATCCTTAATCCCTAACTCATCCATGTTGGGG I I I I I I 

Sequence 693 

CCNCGAATTGGAGCTCCCCGCGGTGGCCGCCCGGGCANGGTACCTGCTTNCCAGAAGTGT 

TATCATGATTAAATGACAGACCAGTGGCAGTAGCATCTCCTGAGGGAGGGTTAGAAATGC 

ATA TTCTC AGGCACCACTGCAGTCTTGCTGAATCTGAAGCTTTGGGGATGGGACCCGGTA 

GTCTTTTTGGATAACTCTGCCAAGNGGTTCCAATGTGCTCAAGTTTGAGAGTTGCTGAAT 

TAAAGCGCTGGGTCTTGCCAGGCATACCTGTAATTCCAGCTCTTTGGGAGGCTGAGGTGG 

AAGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGTAACATAGCAAGATCCTATCT 

CTACCAAAAAAAAAAAAAAAAGTACCT 

Sequence 694 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTCTCTGTCTTTAC 

AGCTGAAGCATCAGAGGATGGAGTGACCAGGCTGGTTCCAATGACAGTTATACGGCCATG 

GGGAGTANACATGGAGTCTAATTCAGTGCTTGAGGCTAAGAATGAAGTTGTATGCATTGT 

GGAMTTGTTCCAGGAGATCTTGCAACTTTCAAGTTTGAAGTCATGTCTGTGACAGTCCA 

GGAATNTGATGCAGCTGTGGAAGACCAGGTGGAAGGGTGTTCTGTAGAAGTTGTGCGCCT 

CTCTGTGGCCGGGGTGCTGTCCATGGTACCT 

Sequence 695 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTATCANG 
GAAAAGAAAACGTTATTTGTCCCACAGATGCTGCTAGGAGCAGCTACCCCAAGACAGGCC 
TTGCACCTTGGGTCATGACAATGCGTGGCTACTGAGAGCTGTTGACAGAGTGGACAGGGC 
CCAGACCAGGACAGTCTCTCTAGAGGTCTTCACCTCCTCAACCGTAACTTAATCAGCCCC 
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ATGCCGGGCTAGCCCCATGCCACAAAGGCTCAGAAATGCCCTGCAACATGTGGGACACCT 

GGTAGTATCTACATAGGGGCCAGCATCCATCCCAGCTGCTGGGGGTGGCTCAAGAGCTGT 

GAGGGACACCCTTTCCTGCCTGATACCGTGGACCAGTTTGCAAAGAGCTGACTGTCCTGC 

TAGGCCCA 

Sequence 696 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACI I I I 11 1 1 I I I I I I 

TTTTTGTATATTTAGTAGAGATGGGGTTTTACCATGTTGGTCAGTCTGGTCTCGAACGGN 

TGACCTCAAGTGATCCGCCCACCTTGGCCTCCCAAAGNGCTAGGATTACAGGCATCAGCC 

ACTGTGCCCAGCCAGCCCTATGCTTTTAAGAGTTCGATGGTTGAAAGAGACTGAGCGGGG 

MGGTAGAGCGGGGCAGGGGAGGGACTACTTGGAGTCAAGTCAAAGTTTTAGGGAAAGAC 

CTGAATCTGAAAAAGATTATTTAACCTTTATGTGTCTGAAATACTATATTGTGCGAATTG 

TACCT 

Sequence 697 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACACGACAGAAGCCCACGGAGCAAGCCC 

TGTGCTGGCCCCTTCACATGACTTTAGGCCCTCTAGCMGGTGATGTTTATTTACAGGGT 

TGCATAMCAAGGCCTCACCATTCAAAAAACCTTGTATTCTATTACATGTTTCACATTAA 

CAMGACTGGAAATTCTNTAGGAAAGGGATCTTTTTTATCTACATGAAAAGCACAGGCTA 

GTAAAGACTTGTTGAAAAAGTTGAAAGAACATAAATGTATATGGTATATGCCACATAGCA 

TAATGGAGGAAGATAGCAAATAGGAAACATATTGGTGAGGAAGACTGGAGTTTGATGATC 

TAGTCAGGAAMCATCAAGTTAAATCCTTTACTTTACACCTAAACCATAAACTGGTGAAT 

AAAACAAGTATGTGAAAGCACAANANGAGAGAGGACAGGCCGGGCGCAGTGGCTCACGCC 

TGTAATCCTACACTTTGG G AG 

Sequence 698 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCTATGCAGNCATNC 

TGTNAGNACCCATTCCATTNTNCTATCCCTGGNTNGCTGGTGTCAATACTNTNAAGCGAN 

TTACTGCNGGNGCTCTNNTTTTTCCCTCANAGATACCNNGTTGATTTCTTTGATTCTCTC 

CATCTCTACAGGCATAATAACTCCTAATATTTAAAAACNCTGTAGAGGGATGNANNGAAG 

CTGNGGNGAGAGCCCNTGGGCTTTTCNCNTGGGTNAAGATGCACATTCCTGAAAATTNTG 

GGCCTTGGCTTAAGCTGNACTAGNGCCGGCCACTCAGCTGATCTCACTAGCGTCACCTGT 

CGCAATGGTGCTGAAGCGCACTNCCNAGAGGCCATAAGGCAAAGCGAGAGTNCNTGGCTA 

TNGACTGGANCCCATTTAAGCAAAAAAACATGCCTCNCGNANGACAAATTCNATCAACAA 

AGGGNGGGCAATACAGGATCTGTACCTGCCCGGGCGGNNCGGGCANGAACCI I 1 1 1 I 1 M 

TTTT 

Sequence 699 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGCCCGCCACCCGGCTT 

GTGTGTCATCCTGGGCCCAGGCAGGTGATGATGCCAAACACAAGGCCCAGCTATGTAACC 

MGTAAAAACTTTCATCAGAATGCCCATCTTTGTGACCCACAGCCCATTGTCAAGAGCCT 

TCCCTGTGCCAGGAGTTCAGCAGGTTCACCTCCGCCTCCACTAGTCACTAAGACACGGAT 

ATTTTAAGAATTAAAAGCCTCCACAAGCCAGGCACAATGGCTTACACCTATAATCCCACA 

ACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCAACGAGTTCGAGACCAGCCTGGGC 

AACATAGCGAGACCTTGTCTCTACAAAAAAATTTAAAAGTTAGCCAAGCATGGTGGGGCA 

TGTCTATAGTCCTAACTACTTGGG 

Sequence 700 

GCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTAGAGGCCAGGTTTCACC 

ACGCTGGCCAGGCTGGTCCTGATCCCCCGGCCCCAGGTGATCCGTCCACCTCAGCCTCCC 

AAAGT GCTGG GATTACAGGCGTGAGCCACCGCACCCGGCTCI I II I 1 1 I I I I 1 II IAAAA 

TCATGAT TTTAA CAGMGCCTCCATTCAAGGCGAGACATGCCTTTTATTTCTCTAATTGC 

GAGACACTTTTCTGAATCCTCTTGTCAGTTGCACCTTTTAATACAATTGAGGTGACACTG 

TTCTTCATGGTGACACTGGTCTTTCCCAAGAGGTTTCAGCTMTTCAGTCTATCAGATTT 

TACATCAGATTTTAAAATTTGCTTCAAACTTGGGTGCTTGTATTCAAATTCATGCTTCAT 

AGAAAAATGCATATCAAGTTCAACAGTTGACTAACTGCAGCCACGTTCACAGTACCTGCC 
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CGGCGGCCGCTCTAGAACTA 
Sequence 701 

ATAGGG CGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAAGGCTGATACACAC 

TGACAGATTTTGTAACMGGGACATTTAAMCTGAGCTGGTMTAGACTTGATTTCTGGT 

GTTGCCACTCAATAGGCATGACTAMTAGTGTACCTCACTGTTCTACTTTTTATAATTAA 

MTTTTAGAGGAAGCT GAGTTCTTGTATTTAACTACAAGTTAGAGACTCAGCCCACAAGC 

I I I I I I I I I I I I ( I I I I AATATGGTTTC I I I I I I I I I I I I I GAGACGGAGCCTTGCTNTG 

TCACCCAGGCTGGAGTGTAGTGGCGCGTCTNTGCTCACTGCAATCTCTGCCTTCCCGGTC 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACCGGCGTGCACCACCACG 

CCCAACTAATTTTAGTATTTTTAGTAGAGACGGGGTTTCCCCATGTTGGTCAGGCTGGTC 

TTGAACTCCTGACCTTGTGAACTGCCCACCTTGGCCTCCCAAAAACGCTGGGGTTACAGG 

CGTGAGCAACCATGCCCAGCC I I I I I I I U H I I I IATTT 

Sequence 702 

AGGTACGCGGGATATATNTAAATTTAAGAAANCATCCCCGGTAATATGGCTCTTCATAAT 

TCTAAGACTAAGGCTGGNGTAGAAACCTAACCACCTACCTTACAAGNGAAGGGGGCTATA 

CCATGGGGTAAGCCMGTTTGAAATTTATGGGGAATCNTACCCAACTTGGNTTAAGGGGG 

CCCTNGGATTTGGCCTNGGGGGCCAAGNNTTTTCTGTA I I I I I I ATAAAAGGTGATCTTN 

CATNGGTATTCCCTTGGTTTACCTTGGATAAGGGGGGATTACCAATGCCTTCTTAAGGAA 

AAAAATTCACCTTATTTGGGCCTTGGGGGGAAGGTAGGGTNGGGCTTCAATAGCCCTTGG 

TMATTCTCCCAAGCCACTTTNGGGGAAGGAAGGGCCTGGANNGTTTTGCCGCCCCACTT 

ACCACTTCCCAAGCCCTTGGGGGGTGGAACCAAGAAGTGGGAAGGAACCTCTTGGCCCCT 

CAATATNNAAAAAAAAATNAGAAAGGNAANATTNCACCTATTCTTACCANAACCCCTAAG 

NACCTAATTTTTAAAAAAATACCAAAAAGAATTGGCCCTNGTTTNTCAAAAACCACCTAA 

TTTGGGAMTMMNMNGGGGTGGAAGAATTATTTTCTTTAACCCNNATNGGAATAAAA 

ATNNATNNNATTNGGGGNTCCCTTGGCCCCNGGGCCGGGCCCGCTTCTAAAAACNTAAGN 

GGGGATCC 

Sequence 703 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGGGGCAGGTACTACTGACTTACCTGCC 

ATGGGCTTTCTCAAGACAAGTCCTGCAGGAGAGGCACACTCACTTCTAGCGTCAACTATT 

GAACCAGCCACTGCCTTCACTCCTCATCTCTCAGCAGCAGTGGTCACTGGATCCAGTGCT 

ACATCAGAAGCCAGTCTTCTCACTACGAGTGAAAGCAAAGCCATTCATTCTTNACCACAG 

ACCCCAACTACACCCACCTCTGGAGCAAACTGGGAAACTTCAGCTACTCCTGAGAGCCTT 

TTGGTAGTCACTGAGACTTCAGACACMCACTTACCTNAAAGATTTTGGTCACAGATACC 

ATCTTGTTTTCAACTGTGTCCACNCCACCTTCTAAATTTCCAAGTACCT 

Sequence 704 

CCGCGGTGGCGGCCGAGGTACTGTGAAAGAACTAGCACTTTGAGCAGAGMCAAATGCCT 

TACTTGAGTTTCCCCTGGACTCTATCCCTATTCAAAGATGCTTGG7TATACCTCAAGAGG 

GAAGCAATCCAGACCAACTCCTATGACATGACAGGCACTCAAGCCTGATGAGGCAGAAAC 

CTGGCAGCTGTAGATGTTGGAMGGATAATTTATGTGTTCAAGTGACTAGGATTCTAAGG 

GCTAGATGCTAGCTTCAAGCACGGCTGGATCTAGGMGCCCTTTTGCTCTCCCTTTTCCT 

TGGTCTACTTTTCTCTGTAGGCMGTTCATTCATTCTTAGGCAAGTTCTCTGCATGTGGC 

AGCAATGATGGACACTGGAATCTCTGGGTATTCAGAGTTCTTTCAGTAGCAG 

Sequence 705 

CGGGCGGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTA 

GGTGCTCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGC 

CCGTTCTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGA 

GGGGCTGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGA 

ACCCCAAANGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCA 

TTTGTGGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCGTACCT 

Sequence 706 

NGGTTAANTGCCGCCNCTTGGCCGTAATCATTGGGNCATTAAGCCTGGTTTTCCCTNGTG 
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TGAAMMTTTGTTATCCCGCTCCACAATTTCNCACACCAAACATTACCGAAACCCGGGA 

AGNCAATAAAAAANTGGTAAAAAGCCCTTGGGGGGGTGGCCCTTAAATGAAGGTGGAAGC 

CTMACCTCAACMTTTAMTNTTGGCGGTTGCGGCTCMCTTGGCCCCCGCCTTTTTNC 

CAAGANGCGGGGGAAAAA 

Sequence 707 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAACATCCATGGCAGACGCTATT 

CTTTCCCTCTTAGAGATGCAGACACTGAGGCTCAGAGAAGTTGTCCCGCACCCAGTATGT 

GATGGAGAGGTAGAGGGTAAAMCATCAACTGAAGGATTTAGCATTTGGGGAAGAAGGAA 

GAAGCCCAAAATGGAGTAGATCAAAGGCTCCCCCGTGAACAAATTTAAAATTAAGGAGAA 

AGAAGCAGAATTCAGTCTTCTCCACACCCATAACCAAACAGCTCCTATGAAGGCACCAAG 

CCTGACGCTCATCCCAATAAAAAGGAACGATCTGGAGAGAGGGGCAGCCGCTGGTGACAA 

GAGAACCCCCCAGGCAGCCTCGTCATCTGGCCAG 

Sequence 708 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTGGGGGACAGATATCATAT 

AGGGGATTCATGTCAGTGACCAAACGAAGTGACCATTACAGCCCTTTNGAAACCTGAGGT 

GTMTTTTTAAAMTGMCTCATGACTTTAATAGTCATAGACTCAAACCTGAGTTGATTA 

TTATGAATTAGTTTATGGGAGTCTCAATATGTGMTATGATGGAGACAAGTTTTGGAATA 

CAGATAAATCMGTCACTGTATTCACTCTCTCTCTCTCTCTTTGAATAGCCTTATCTTTG 

CCTATACACACAAACAGTGCAGCCATCAMATTTTCAATTTACAAAATGTTCACAGTCAT 

GCTTCTTCCTTGACTAAACACTGGGGTTGCTGCCAGTGGTAATTGGCTTGAAACCAGCTA 

ATTTTTATATATCTATTTAGTCTGGATATTCTAGATGAGTGGGCACTATAGT 

Sequence 709 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCCCACTCCTT 

GACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCACTACTTGGG 

AGGGCTAAAGGGAMTTTGGAMTAAMTTCCAAAGTTTGGAGTAAAAAAATTCAAGTGT 

TGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGMCATGTGTTTAT 

CTGTGGGAGATMACMGATGGAGTCCCAMGACTTTAACMMTAT1TTTTTAAAAATC 

CACTAGAATAGAAMTACATTATTTAGATATACTTTATGCTGAGAGTGAGTATATATGCT 

TGTCCTATTTAAACTTGTGAGAAAAAGTGGTATCCCTTGATACATTTAGAAATATGGGGG 

CTATCTTGGTTCAJTGNGGGGGGTGGGGGCAGAAGGAGAATAAATGCAGGATGCCCTTGT 

TGAAAGGAATCTTAGCATGGCCCACAGGGGACGTTTCCAGTCGATTACCAAGGAATGCCA 

GCCT 

Sequence 71.0 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCC 

ACACTCCTTGACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCA 

CTACTTGGGAGGGCTAMGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAA 

TTCMGTGTTGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGAACA 

TGTGTTTATCTGTGGGAGATMACMGATGGAGTCCCMAGACTTTMCAAMTATTTTT 

TTAAMATCCACTAGMTAGAMATACATTATTTAGATATACTTTATGCTGAGAGTGAGT 

ATATATGCTTGTCCTATTTAAACTTGTGAGAAMAGTGGTATCCCTTGATACATTTAGAA 

ATATGGGGGCTATCTTGTTTCATTGTGGGGGTGGGGCAGAAGG 

Sequence 71 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACGGTGAGCCGGAG 

GAGTCATGTCAGAGGGGCGAGCAGGAGCGATTCCGTCGCCAAACAGGTTATGAGTGCCAG 

TGAGCCGCCTTAGATAGMGCATCGTCAGCACTTTATTAATGATGGATAGNGAGAATAAA 

CCCGAAAATGACGAGGATGAAAAGATAAACAAAGAAGCACAAGACTTGACAAAGCTTTCA 

TCCCATAATGAAGACGGTGGGCCTGTATCTGATGTGATAGCAAGTTTCCCTGAGAATTCT 

ATGGGCMAAGAGGTTTTTCAGAATCATCGAACTCTGATAGTGTTGTTATAGGAGAAGA 

Sequence 712 

NCCCGCNGTGGCGGCCGAGGTACTCTTATGAGAGGAACATTAAAATTTGCAATTATAATG 
CAAAGAAACAGGAGACGATCGTGAGAATAAGCAATGTCACACACATTTCTCTCCAAACTA 
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TATGTATCTTGTCCTTTAAATTCTTTGACGTGTGTGTGGGGCCTGGGTGGGGGGTGTGGG 
TGTGTGGTGTTCTMGGGCCTTTTCTACTAATTCCAATAACTGGGTTATCTGTGAGGCTG 
CTTA7TCTCACTGTATTTTACTGCTTCTTTGCCTTTCTTG I 1 1 1 1 1 I I I 1 1 AACCATACT 
CAGGTATGGTTAAAATGTAAATGAAAGAA 
Sequence 713 

CCGCGGTGGCGGCCGCCCGAGCAGGTACTGGGGCTGCACAGGCTGTGGCAACACTGGCTA 

GTCAMGCCTGGAMGACCTGTGAGCTTGMCTCTTCTGGGTTGCCATTGTTCTGTTGGT 

TTCAGCAGTGAGACTGAGAGAGCCTGTTCTGTTTAGAAMGCCACAGTGGTTTTCTAGGT 

MAGTCTGCAGGAGATGTCACTTGGTGCCTTTTCMTACGAGTTTTCCACCTGCATTTT^ 

GGAACCATTATGGGCCTTTTAAAAATTTATTAATAAGTCTCTTAAATA1TTTATAATCTA 

GCTTCTGAGACAAGATGATTTTAAAACAGTTATATGCTCTAMTTAAAAATTTA 

Sequence 714 

AGGTACCGTGTGAGCAGGTGGCGTTCACCAGGGGTGAGACTTTATTGACAGTAAGTTGCC 

T CTGCCA AAAAACGCCCTCATATGTCTGCTGATGTTTGAATTNCNNCCNNATGGCAGGAG 

GTTTTTTGGTCCTCCCCAGNTTTAAAMAAMTTGGTTAAAAATAACCTGGGTTTGGTTN 

TTTCTTTTGNTATGGGGAAGGCCCTTCCAAGNAAGNGGAAAATAAANAAAAAAGTAATTA 

NANC CTTAA CCTT CCTTAAGG GGGAATTAATTGGTAATTTCAAA A 1 I I I I I GGAATGGCC 

TTANCTTTTNTTAA I I I I I I I I AMTTTTTTAAATTTTGGGANGGAAACCAAGGGGGGGG 

TTTCCTTNTGGGCCTTTCTTGGTTTGGCCCCCAGGGGGCNTGGGGNANGGTTGGCCAAGG 

NCCAATr4mNNGCCAAMTTTCCAMCCMGGGCCCTTTCNMCCTTTGGGCCAAGNTC 

CCCCTTTCAAAAACCCCCTTTCNNCCTTGNTTTGGGCCCCCCAAAA 

Sequence 715 

ccgcggtggcggccgaggtaccgttttatgatgataacataactttaatgctccaacctg 

agaaagataaaatagactaagatgaccattgaatgcaaacagaaagttctaaatgaacaa 

tcaag n cagg acctg g aaatttcaggtccctg gtggttgg aaa antamttaaattaaaa 

accaantttcttggtttttccaaggaaamntggntaaamaaattaaggtntttaaaat 

mnccccagggaaaaatttccaaattcamattattaaaaaggccttaaattaaattaat 

attttttggcatttcnmggcccc^ 

gggtaattaaaccagggccaaatttnattaaaaaaattcc 

Sequence 716 

AGGTACACGATTATTTCACCATCCAGGTATTAAGCCTAGCACCCAAGAGI I I I I I I i i IG 

CTTCTCTCCTTCCTCCCTCCACCCTCAAGTAAATCCCAGTGTCTGTTGCCNCCCNTNCTT 

CGGTANNAACCAAGGTGG 1 1 1 1 I 1 I I I AAATTTTTCCACCAAAAAAAAATCTTCNCATNC 

MCCTTCCTTCTT1TCAATTTTTTGGCCTTNNCCTTAAAACCGNAACCTGGAAAAACCCT 

TN CTGG GTTNTGGCTTCCCCC AGTAAAAG N ACC AAAG G AAANTTTTAAANTTTCAATTTT 

TTTMTTCCCCGGGTTTTTTAAAMAGGGTTNAMAGGGAMGGNTTTCCTTAACCCCCA 

AGTGCCCAAGGGGAAAGNTTGGGGTTGGGGGGCCTTCCAACCISIAATTNCCTTAANTTAAA 

MTTTTTGCCCCNMGGGCCAAACCCTTTNNTTTGGGAAGNGAAAGGGGGGCCCCCCGGA 

MGGGGGCCCCGNAAGGGTTTGGGGGAMNTTCCAAACCNMTTTGNTMAGGGGGTTTT 

TGGGGGGAAAAAG 

Sequence 717 

CCGCGGTGGCGGCCGAGGTACTACAATAAGGACAAATATTCAAAACATTCTGTTAAGTAA 

AATAAGACAGTCAAAAAGGAAAGCTGTATAATTACACTCATGTAAAAATATTTAGTCCAA 

CNCTCACAGGANAACCAAAGGTGGTCAATAGGTTCCTCAAGCCAGGTGGCCACCCCAAAG , 

GAATGGTTMACCAAGGTTCTTCCTTCNGTTAAGGTTCCTGGAAGGAATTAAAACCAATT 

CCCCAAGGAGGTTTNCTTTTTGGTTTTCCTMCCCTTCTTAAAGGGGAGGMTTTTAM 

GGGAGGTGGTTAAMANCMCCAAAAMGGGTTTGNAMGGGNTTTTGGGGGAAGGNTTT 

GGGAAAAAAANGNTTTTTAAAGGNA 

Sequence 718 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTT 
GGATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGTGATGA 
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CTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATT 

GAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGGTGTGGTGTAAACACC 

TACATTTTAATGTAMCATTTCAGTGTAATCMTGAGMCTATCATTCTGCTTTAATCAC 

CATGAGTTCTGAAATMCAMGGATTTGTCTGACATTCATTCTMGAAATTCATTCTTAC 

CTGACTMGAMCTTTTTTAACCCGGCACAATAATAAAGAAATGACCTGTNAGTACCTGC 

CCGGGCGGCCGCTCTAGAACTAG 

Sequence 719 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTTGG 

ATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGNGATGACT 

CAGACCGACNCTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATTGA 

AATATCAGCCTTTCCCANACTGACCTGATGGGNTGACTTCAGGTGTGGNGTAAACACCTA 

CAT1TTMTGTAAACATTTCAGNGNMTCMTGAGMCTATCATTCTGCTTTAATCACCA 

TGAGTTCTGAMTAACAAANGATTTGTCTGACATTCATTCTAAGAAATTCATTCTTACCT 

GACTAAGAAACTTTTTTAACCCGGNACAATAATAANAAATGACCTGTAAGTACCTGCCCG 

Sequence 720 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGA 

Sequence 721 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GNTGCTTNGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAAAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAA 

CATGTCCTTGCTCTCCAATGGCCCTGGCAGGCCAACCTTTAGTTTCAGGGCTACCACCTG 

TGCGGGGGCTTNTGTCATTACCCCCTGTGGATATTAATGCTGCACACTGNGGTTATGACT 

TGTACCTTCGGCCGTT 

Sequence 722 

GGAGAGGAAATGTGTAGGGGTGAGGGATGATACAAGAAAGCCAAATCCTCATCTTCTATA 

GTAGAGAGTCAGCGGATAAAACCTAAAAACAATACATCAAGAAATACTTACACTTATGGA 

AGGAAATACCAGAAGTTAAAAGGGGTTACTTCTGGGACATCAGACACCAGACTGCAGGGA 

AGGGCTGCCTCTTGTATTMCMGCTTCCAGTATMTTTGCTTTTTAAAAATAGGTCCAT 

GCATTATTTTAATAAAAATTANGCTGGGCGTGGTGGCTCAGGCCTGTAATCCCANCACTT 

TGGGAG 

Sequence 723 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAGCAAGCTGTGTGACCTA 
GAGCACAGTGCTTTGAG I I I I I GAGTCTTAGCTTCTGTGTCTATAAAATGGGGTTCACAC 
AACTCACCTTACAGGGCTGTMGATTAGATTACACAGAAMTATATTTTTTGGCTGTGGG 
GGCTGGAAGTGTTGCTGATTAGCATTTGAAATCCCATCCTGTGGGTGAGAAAACCCCACC 
TTATGACTTGGTGGGAAACAAAGCCAACCTCCCACTGATGAAGCTGAAAGTAGCAGAACC 
TTGCTTCTACTGCCTCCCTTGCAGCTAGAGGCAGGCACAGGACTAGCCTGTCAATTGGAT 
GCAAATGCTCCAGGCCTTGAATCACAACCTGGTGACTTGCACCCAAGTCTATTA 
Sequence 724 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACCGACTTNATCATGCTGAG 
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TTGTGTGCAGCTGCAGCGGATTCCTCTGAGACGCTTGTCTATCGCATCTGCCNGGGCAAG 
TNTCACCTTCCCTGGGATGTNCCCTTGGACAAGANACAAGGGAGGAANGGCCTTAANGAT 
CCTANCTGGGGGGGGA 
Sequence 725 

TAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCGCCCGGGCCGGTACCCATAAAAATTAAA 

AACTATTTTAAAAMTAAATTCCATTTGAGCCACTCCTTCAAACCACCCAGAGTGGGTAG 

ACGTCTTTCGTGCCTCTAAGAAGCCCCATCTCTATTCTGCGTCTCACCTTGCAGGGCTGC 

TCATCTGAATCCTGAAGATGGTGGACACCCATCTGCTAGGACTGAMTGAATAGGACAGA 

GGGAGGTGCAGAGTGAATGGACCATACTACCTGTCATCTTGGCAACGTGTGATTGAATAA 

A ACMC TTCTTTAGMGTTTGATAGAGTGATTTGATMTGTMTTTACMGTGATCATTT 

CI I 1 1 IA 

Sequence 726 

GGAGCTCCCCGCGGTGGCTTTTTGAGTCTGGACAGGNCTCTGTTTTTGCTTTAAAGTTAA 

GAGAGCTAAATAAATGATGGTAAAAAGATAATAAAATAGAACATGAAGGGCTGTCAGTCA 

GTGTAGGTATTTCCATCCCCTCACTTTTCAAGTGAGGTCACGGAGGCTCAGAGCGATAAG 

GAGACTTGTCCAAGGCCACACACCGGCTGGTGCTCAAGCCGGGACTTGAACCCACGCAGT 

CTGACTCTAGAGCCCAAGCTCCTAACTATGACATCCTATTTGATACACTGTTTTACTGGA 

GAAACAGATCATTTGACAGACATTCTTTCTGTTAGCMTTTGACAACTCTTTCCCCAGTT 

GTCTGTACCTGCCCG 

Sequence 727 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTA 

TTCTCTTACAGCTAGGACCCAGCGTTCTGGGCAGAATAGCAGGCAGCGCAGTAGCTACTG 

AGATTATAAGTGGTAGGTTTATGGAGCTGTGACCACAACTCTCCACAAGCCAGTGCTGTC 

TCATGCAAGCACTCTCAGTGTCCAAGTGCTGAGTGTGTGAGTGGTCTGGGCTTTGCAGGG 

TCGGCCAAGCTCTTGGAAGAGCAGGCTTTGTTAGCTGGGGAGTCATCGCTCCATGCAGGC 

CCTGAGAATGGAGCATCCTGAGTGGACTGGTAGAGATGGGGCATGGGTCACTCTGAGGGT 

TTGAGCTACTTCTGCTATTTTTGAAATTCTGGTTTGAACTGCAGGATCGTGCTGAGTTTG 

GCACAGACTMTTTCTCTGTTGGCAGCACATGATGTATCAACTCATGTGTCAGTTGGTTT 

G 

Sequence 728 

CCGGGCAGGTACTACCTTCTCTGCTACAAGTCGAGCGAGGAGCCCCGCATGAGCCCTGAC 

ACCTGTGCCACCATTTTGGAAAAAGCTGGTCTCGATAACTGGGCTCTTGGAAAAACAAAA 

GTGTTC CTTAA GTATTATCACGTGGAGCAGTTAAATCTAATGCGAAAGGAAGCTATTGAC 

AAGCTTATTTTGATTCAAGCTTGTGTCMNAGCATTCTTGTGTTCAAGGAAGGATACCAA 

AAAATACAGGGAGGAAAAGGGAAAGGAAAGCCGCTTATAATAATACCAGTCAGCTTGCAA 

GGAGGGACCACCCTTGTCAGGGAAAACMNAAGAAAAGGAMATTTGGTTTGGACCATTG 

AAAAAACCCCAGCANTTMCCAACCCAATTTCAAAAACTTTCTTGATTCAGGGAAATTTC 

GACTTACCAAGAAAAAACCTTTGGNAAAAATACCCANGGGGGGNTCCTGGGNAAAGGGNA 

GGANGGGAGCCCAAAAAAANAATTTGGANAACCCCCCNANGACCGACCCCCCNGGAAACC 

CCCA 

Sequence 729 

CCGCCCGGGCAGGTACTTTC I I I I I I I I I 1 I I I I I I I 1 1 I I I ACGG AGTCTTGCTCTGTC 

ACCCAGGCTGGAGTGGAATGGTGTGATCTCGGCTCACTGTAACCTTCGCCTCCCAGGTTC 

ACGTGATTCTCCTGCCTCAGCCTCCGGAGTAGCTGGGATTACAGGTGCACACCACCATGC 

CTGGCTMTTTTTTGTATTTTTAGTAGAGACGGGGTTTCACGATGTTGGCCAGGCTC 

TTGAACTCCTGACCTCAAGTGATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTATAG 

GCATGAGCCACCACGCCAGGCCCACTCThrTAAATTTTGACCACCCTGCCTTGAGTGGTCT 

TCTAGCACCCTAACCTCTGTCTAACCTTCGAGAGCTTTGCACTAGCNATTCCTGGGGACC 

AG CTATG GTTGGTATCTTCTCAACTTTCTAA I I 1 11 1 I AA A AAT ATTATTATTATT ATT A 

TTATTTTAAAA 

Sequence 730 




WO 01/070979 



PCT/US01/09126 



TABLE 1 
118/467 



TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGCAGGTACCTGTGTGGAA 

MGMTGCTTGCAMGCTTGTCACCCTCACGAGAATTCCTCTGACAGACATTTGCCTTTG 

ACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGAAACCGAAGAGCATCAGAGGGGACGA 

CTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGAGTTGGTTACCAGAGAAACTGGCGAT 

GGCGATCCCGTGAGCMCATCTCTCAGACCCATTTTAAATGCCGGGGGATACTTAATCAT 

GCTGAAAAACAGCAGAGCCCTGAGGTTTTGGACTACATGTTGCAGAAAGAA 

Sequence 731 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTACACCTGGGACAAA 

TACTTTTTTGTGAAGNCAGGTAAAGCCTTTGCGTGCAATATAGCATCTCTATGCAATGCA 

NCAACTCCTCGTCTATCGCTACAGTAAGAAAACAGCCACGGGTCAGGTGTTGNGGCTCAC 

ACCTGTMTCCCAGCACTTTGGGAGGTCNAGGTGGGTGGATCACTTGAGGCTAGGATTTT 

GAGACCAGCCTGACCAACATGGAGAAACCCCATCTNTACTGAAAATACAAAATTCCCGGG 

TGTGGTGGCNGCATGCCTGTAATCTCAGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTT 

GAATCTGGNAGGCGGNCGTTTGNNGGTGAGCCAAAAATCXaTGCCATTGCACTCCAGCCTG 

GGCAACAAGAGCGAACTTTCGTTTCAAAAAA 

Sequence 732 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTAGTTATTT 

TAAATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGT 

GGAGCGGGAAGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAAT 

GACTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGC 

TCATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGA 

GTTGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAA 

AATACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAG 

AATCACCX^TTGAACCCAGGAGGCAGAGGTTTGCAGTGACCCGATATTATGTCACTGCAAG 

TNCAGCCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAAAAAGAAAA 

Sequence 733 

CGACCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCGGGGCACGGTACTGC 

TGGTTCAAGCCATTAMTTACATCACAAAGGTTTGGTTTCTCTGTATATATTTCTCTGGG 

GCACTTTTGCTANGTTGGCTCTATCCTGAGGCAGNCTCTCTCCTCGTGGNAACCAGGTGG 

CTCTAGCAGCCTCAGCTTTATATCTCTCAAGAGTAAGTCCACCGTCACAGAGC 

Sequence 734 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACA 

CCTGGCTAATTAATTTAAAAAAAA 1 1 I I I I I I AGAGATGGGATATCTNTATGTTGCCCAG 

ACTGGTCTCAAACTCCTGGCCTCCAGCAATCCTCCCACCTCACCTCCCAAAGCCCTGGGA 

CTGCMGCATGAGCCACCATGCCCAGCTATATTTTCTGTAAATTGCTAATGANAATGAAA 

CATGTATGCTGTGGACAGMGCCTTGTTGGACCTAGAGCCCATGCTGGGTCCTTTGCCTT 

AATMMCATMCTCTGGCATTACATATATMTTMCAGCCTCAMGANCATGTTTCTTT 

ATTAMCTCTGACTGTTTCAGCATTATTTT 

Sequence 735 

GCGAATTGGAGCTCCCCGCGGGGGCGGCCGCCCGGGCAGGTACTACTGTGTCCTTTAGAT 

CACTCTGCCTTGATCACTCTGTCCCGTCACTCTGCTATTTCACCTGNCAGNGAAATACCT 

GGTATCGTCCTGCCAACGTGAAGCATTGAATGCTTNATACGTCTCCATCCTGATTGTTTA 

GGCTTTGAATGCTGAGAAGTATCTGCACTTTGTTGGTCA 

Sequence 736 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGTACCCTTAAGCCCTGCTCCT 

TTGTAMGTCTTTTTGGATTGTCATCATCAAGAGTCAGTNGATCTCCANC7TCTCAGAAC 

TCACAGGGCACTCTGTCTAGGCATTGCTGACCGTCTGCAGTGTCAGATGGTGACTTCTGT 

ATGTGTTGTGTTTCCCGTTAGACTCTAAGGTTTTTAAAGGCGAGACTCACTCCTGCAGAA 

GCACATMCACAATGCCAMCTCTTATTTACGGAGGTCCTGGCGCATTGTCAGCTTTTGG 

TAAATGCTTTTCTTTTGTTGAATACTTATCTTCTGTGTGCCAAGATTTGTGTTAAGTGCT 

AGAAAAATGTGGGAGGTCACCGCAGACCCTGTTCTCATGGAACTGATGGTGTGTAGTGGG 
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GTGNGGATTAACATAAATAAAATGATGCGCAAATGAACACAAAATTCAAATTGATGATGT 

GTACCTGCCC 

Sequence 737 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTGAGCAGTCGAAG 

TGGATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAANGATCCCAATGACGATGATT 

GGAAAAAACTTCA ATGG CAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGT 

GCAACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCT 

TCMCAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGC 

TCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAA 

GAGATAGTAGGCCACAGTGTTACTGGCTTCCCATAAACACAGCCCTTTCCTGGCTCACAC 

GGGGCATGACCTCCCGCGTACCT 

Sequence 738 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTAGTTGGATGTCGAGGTTNGATTAGATTCT 
GGGGTTGGTTTGCTTGTTTTGGTGGATNGTTT^ 

TCCACCAG AAGGCACATGTGCTTGCCTGTGTC I I I I 1 I GTTATTGTTTTGAGGCAGAGCC 
TCNCTCTGTCTTCCAGGCTGGAATGTAGTGGCACAATCTTGGCTCACTGCAACCTCCACC 
TCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTNCCAAGTAGCTGGGATTACAGGTGTGT 
GCCACCATGCCCAGCTAATTTTTGTATTTTCAGTANANATNGGG I 1 I I I I GCC 
Sequence 739 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGGGGGCAGGTACACTTCCACGA 

GMGAATTMTATTGTAGTGTTAGGAAAACTAGCAATTTAACTAAACAGCATCAAGTTAC 

AAACCAGGAAAGTGATTTAAAACTAMTGCTGGCTTATCTTTCTGAAACAAAGCATCTAA 

ATTTGACAGTCCAAAATGGCACTTATTGAGTGTCCGTGACAATACATGCTGACAAGCAGC 

ACACCTCTTTTTTGT1TTTTTAAGACGGCATCTTGTGCTGTCACCCANGCTG 

Sequence 740 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATATAAAAATTGATTTAGCTTCTACACTCA 

AGTAATTATAAACAGGTTTNTCTTTTGGGACATTTGACAGTTATGTGAAAGGTGAGTCTT 

CGTTGTGTAGTATTGTCTGTTACACTGCAGGTGTCTAGAATTGCTGATAGTGTTCTCCCT 

CTAAAGTAATGTCACCCAACCACTTGTAAATTGACGATAATAAGACAGGAAATCAAGAAC 

CAATATAAATAAGCAAACATTTGAAMTMGAGCTAAAAATCAAAAATAATCTCTCTTTT 

TGCTGATAATACTTTATACCTAAATAACCTAAGA I I II I I I I 1 1 1 I I I i I IGAGACAGAG 

TCTTGGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCCCGGCTCACTGGAACCTCC 

GCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGAAGCTGGGATTACAGGCA 

TGCCCACCGTGCCTGGCTATT 

Sequence 741 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGTATAAATCATGCT 

CCTGCTGTCTAGAGCTTACCACCCAACGAGGGCTTCAGATAAGATCAGCAACTGCCCTAG 

AGTGTGGAACTCCTATGACAAGGTGAGCCTGGGGTGTGATGGAAACACGGCGTGCATGGT 

TACCAAGCCACGCTTCCAGGGAAAGGGGTCCGTGCGGGAAAAACTTCAGAGAGGAAATGA 

CATGTCAGTCAATAACCTGAAAGAACTGGNTGAGAGTTAAGCANCAGGGAACAAGGGCAC 

AGTNNTCCACACAGCTTTTTGGAAAGATCATGTTGNTTATAGTGCAAAAAAATACTGAAT 

ATGGGAAACAATTTGTTATTA I I I I I I AGGAGTNTTGCTTTGTCCCCCAGGCTGGAGTGC 

A 

Sequence 742 

ACTACTATTGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGGGGGAAAA 

AGAGTTACAATTTGCCCATAAAGAATGGAGAGAACAGAAATGTANCTTTTATGCTGAAAA 

ACAAAATGCAAGGGCAATCCAGTTTCTAATTCCTGTGCCAAAGCTGCTGTTCTTGATGAC 

CTCGGTCAAATCATTTAAATTCTCTCAATTTGTTCATATAAAAGTGCTATTAACCTGCAG 

TTCCTTCAAATACTATCCAATCAATGTTGGCTACTTGATTTTCA 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCACAGGGTATGCATAAA 
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CCAMTATACAAGMTTTMTGACAGCATTATTTGCAAACAGTGAAAGATTCTGAGCAAT 

CAAAAGGTTCATTACTACAGGCATAGGTCTATAAATTACTACTGTATGGAATACTATGCA 

GCCATTAAAATAATGAGGAAGAGGGAGAGTGCCCTGTATGCACTGACATGGAAAGGTTTC 

AGTATATGGTAAAAAGCAGCTCATCTTAGAACAGATTTTATAGTATGACCCCACTTGTGT 

GGAAATATTTTGTGATGTGCCACTCAGTGTATACGTATTCATGAGTGCATATACAAGTGT 

GTGAGAAGCAATGTAAGTAACTGTTTCACAAGGACCCCCCCTTTAAGAAGGCAGAGGGGA 

TCGGGGGATATAGAGTGMGGGATGATTTTTGCTTTTTCCTCTA 

Sequence 744 

CCGCGGTGGCGGCCGAGGTACGCGGGAGGAAAGGGCTGTGTTTATGGGAAGCCAGTAACA 
CTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACA I I I I I GGGACTCGGGAATTATGAG 
GTAGAGGTGGAGGCGGAGCCGGATGTNAGAGGTCCTGAAATAGTCACCATGGGGGAAAAT 
GATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCCGATCGCTTTTTGGCCTTGATGATTT 
GAAA 

Sequence 745 

CCGGGCAGGTACACAGTAAGTGAAGGGCCAAGACTGACGGCTGATAGGACAGGGGTGACC 

AGNGGTGGGGAGGGTAGTGGGAGCAGTCCATCCTGGAATCTGGCATTCAAGGGGCGCATT 

GTCTGTGGGAGGATTTAAAAATMTAAMCCAACTAAAGGCAGTCTGCTTTTTATGGTCA 

CCAGGCCGCCAGCAATTCTAAATTTCAGTGATAAAATATTCCTCCTCACTGGACACGAGA 

AGCTGGCTTTCTCCTTATTCCCCAGTACCTTNGGCCCGCTTCTAGAAACTAGGTGGGATC 

CCCCCCGGGGCTGCAGGGAATTTCCGATATTCAAAGCCTTATCCGAATACCCGTCGACCC 

TTTNGANGGGGGGG 

Sequence 746 

CGGTAATACTNGTTATCCACAGCAATCAGGGGGGATTAATCGCAGCNAAAGAACATTGTT 

NAGCAAAAAGGGCCAGTCAACAAGGGCCAGGAACTCGTAAATAAAGGCCCGCGGTTGCTN 

GGNCGTTTTNTCCCATAAGGGCTTCCGCCCCCCCTTGGACGAGGCNTCACCAAAAAATTC 

GACCGCTCAAAGTCANGAAGGTGGCGGAAACNCCGACAGGGACCTATNAAAGGATACCAA 

GGCCGTTTTCCCCCTTGGGAAGGCTCCCCTNNTTGCCGCTCTCCTGTTCCNGACCCTTGC 

TCGCTTACCGGATACCTGTCCCGCCTTTTCTCCCTTTCGG 

Sequence 747 

CCGCGGNGGCGGCCGAGGTACATCTTTGGTGACTTTTCATTCACATTTCATGGATAATTT 

GGGGAGGTGGCCTGCCANCCCTGAAGCCCTACATCCCCATACACACTCTGTGCACATCCA 

GTGCCCTGCTCCACCATGGCAGTGCCCGCAAGGGGGTCCCAGATGAGAAGAAGCTGGCTA 

AAGGGCCCTTGTCCCCTCTCAGACTCCTTCAGCGGGCTGGAGTCCTCCCTCGCTCGATTT 

CGCCCGAGAGCGTTAGGGGTTTCTAMTGCAGGCGCCTTTGTGTTGTAACGMACTTTTA 

GTTTAAGGGAAAATCTCTTTTAAGCCACTGATTGTTCTGACTTGCTGAGTTTACTCAGCA 

GCCTTATGCTGGCTCTGCCACTGCACAATAAAACCAAAGCANGACAGTTGCAGNTNAAGC 

AAG GG GG AAC ATGTTTTG C ATTT 

Sequence 748 

GCCCGGGCATGGTACCTGTGTGGAAAAGAATGCTTGCAAAGCTTGTCACCCTCACGAGAA 

TTCCTGTGACAGACATTTGCCTTTGACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGA 

MCCGAAGAGCATCAGAGGGGACGACTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGA 

GTTGGTTACCAGAGAAACTGGCGATGGCGATCCCGTGAGCAACATCTCTCAGACCCATTT 

TAAAATGCCGGGGGATACTTAATCATGCTGAAAAACAGCAGAGCCCTTGAGGTTTTTGGA 

CTACATGTTGCAGAAAGAAAGAAGNAATTTNTACCTTNNCCNAAAAATAAAAATATNNNA 

NNNNGGTACCTCGGGCCCGGTTTTNAACTAGTGGGATTNCCCCCGGGCTTGAAGGAATTC 

GNTNTTCAAAGCCTTNTTCGATCCCCGTCCNANCCTCNANGGGGGGGGGC 

Sequence 749 

AGGTACI I I ! I I I I M I I I I II I I M I GG NCTAACTGNNNGGAGTATTTCTTTTACCCAA 
GATAAGTAAAAGCTACAACTCTTAGTATAAATATGNGTCCAAGTGCCTNATAACTGCTAA 
CCACAGGGATCCTGAGCTCTNATAGCTTAAACACACAGNGTNNATTTTACTGGTCTACTT 
CTCCTGNAGACCTAAAAGGGCCTATAGCCTCAGTAGTTGACAAAACAACATATTAAAATT 
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CCTCACTGATCACTMCATAACCTAAAATCCCTGCTTTTGACATTAGCATGGNANACATC 

CTTAGCAGGCCTAAATAGAATGGCCTTATAAGTGGATCCAAAGGGC 

Sequence 750 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAAC7TGACGATGGACTCCGTGTTG 

GTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCA 

GGCACCACCAATTA 

Sequence 751 

GTGTCCGGAATCCTACCCGGTGTGNNGACAGTGCCTGATAGTTTCTTCTGCCTTTCTATC 
CCAAAACGATTGGTCAGTTTACCCMGTTTGCMTGCAGTTTANAATCTCCCAGGAACAT 
CTCTTCTAGTAGTTGCCTTAGCCATCTTGTAGTTGATTTGACI I I I 1 1 I I I I I 1 1 GCTNN 
CAGAAAGCTCTATGCTTCATATGGACTTGCATACCAA I MM I GTTCATCTGTTGGTCAT 
GATGGTTAGCAGAGCCTGACCTCCTGTTACAATAGAATGATCGGTTCCTGGGCTACAGAC 
TTGAGTCTG 1 1 M M MGM I M I AAACCTTCCCATGNGGCAATTTGCCATATGCAAAAC 
T 

Sequence 752 

CCGGGCAGGTACGCGGGTGAAAATGGMTAGTTTTCTAATTACAGAAAGAAAAGAAGTTG 

AAGTGGGTTTCGCCATGTTGAGCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCT 

CGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACGCCTGGCCAAAAAT 

CTTATAMTMTCCCCTTCTAATTTCGGCCAGCTTAATCACACACCAAATTCCTTTCATG 

AGATTMTCTTCCACMCTTCTACACTTCCTTAAATCTTTGATTTTGTCCTATACTTCTT 

MM I ATATTAGCMTCTACTTTAGGACAGAAATTTACTTTCCTTTCCTCTTGATTTTGA 

CCAAAGTCCTCTCTTAT 

Sequence 753 

TAGTTTTCTMTTACAGAMGAAAAGAAGTTGMGTGGGTTTCGCCATGTTGAGCAGGCT 

GGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGGGATT 

ACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATTTCGG 

CCAGCTTAATCACACACCAMTTCCTTTCATGAGATTAATCTTCCACAACTTCTACACTT 

CCTTAAATCTTTG ATTTTGTCCTAT ACTTC I M M I I ATATTAGCAATCTACTTTAGGAC 

AGAAATTTACTTTCCTTTCCTCTTGATTTTGACCAAAG 

Sequence 754 

CCGGGCAGGTACCTATATGATGTTGGCCATGCTCACTCACTCCTCCAACCCTCAGTTTAC 

ACATCTGCAAAATGAGATACTTCTTTTCCAGTGTTGCTGTGGACATTAGCAGGCACACAC 

ATTTGGTGCTTGCACAAATGAGGTCCTAAGAGGTGGGTCCCTCTCATCTTACGTGAGGAA 

ACTGAAGCAGATTAGAAATGACCCAAGGAAACCACTCCGAGTTCAGTCTGGAGCCCACTC 

CCCTAGGTTTTAATCATCCCCCAACTCAGTCCCTATCTGCTGAGGTTCCTGGATCCAGAC 

GGTTCTTACCAAGGAACTGTCTGTCCTCACCACATGGATGGTTTTCCTGGCAGAGGTGTG 

CCCTGTGAGGGGGTCA 

Sequence 755 

GCCGAGGTACANACAAGGGGGCNACTGNCATGGGGGNGGNNTCTGGTCTTGTAGTCNGTT 

TGGAATTTTCTAAGTCAGGGTGGGGTGGGGGGACTGTGCACGGGTCATGTGCAGACTGGA 

ACCCATCTCCCCCTCGGTCTGCAAGTTAAAACAATTGGGTTGTCCTTCTCAGCATCTGCC 

AATGTCTCTTANTCAATCTTGGATCAAAAGGGCGTTGGAGGAGGAGGCTGGGAGGGAAAT 

CCAGACAGTTCTCCGCCTCTGACATCAGGTCCAGCTGTTAGCATCGTGCTGTGGGTCCCT 

GAACAAGAAGCAAAGTCAGGACT 

Sequence 756 

AGGTACCGCTGTGTCCGGGTGGGTGGNGNGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 
TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 
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CAACTGGGTTTCCCAAGCTATGTAAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 
CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 
TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 
TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGT 
Sequence 757 

AGGTACCTCTGGATATGTTACACTGAGANAGATACTTCATCACTTACATGATATTGCTCC 

CCCACAAAT7TCATAACCTGAATCTAGTTATAAGGAAATACTATGCAACCCAAATTGAGG 

G ACATTCTGCAAAACAACTACCTGTAATC I I I I I I 1 1 I I 1 1 I II 1 1 GAGACGGAGTCTCA 

CTCTGTCGTCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACTTCTGCCCCCG 

GGGTGCGAGCGATTCTTCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACACGCCA 

CCACGCCCAGCTMTTTTTGTTGGGGTTTCACCATGTTGGCCAGGATGGTCTCCATCTCT 

TGACC 

Sequence 758 

CCGGGCAGGTACTATGGTCCCCGGCAACCTCCCCTTCCTCCTGGGAATGCTCAAATGGGA 
AGGCAGCATGAAACGGTGAACAGGCAATCACTGGACAAAGTCACAAGAACTGGGCTTTAG 
AMTGGTTTTACCATTAGCAGTTGTGACACCTCAGAAGTGGCAACTCTGGATCTNAATAC 
CCTACCCTTNACCCTAAGNANAGGTACCTCCNCNATTTTNNCGGGGGGAAACNTTCTNGG 
GAANTTNCCCTTTCCNAAAAAGGGGGGGGGGGGCTC 1 1 1 I I I I I I I I II I GGGGGGGGGG 
CCCCN NCNN CCCCCCCCN I I II I I M M NTNAAAGGGGGNTTNAANANANATTTCNTTNC 
TCNTTNTTTTNTNANGNNAAAAANTCNGGTNGNGGNGI I II I I I I 1 1 I ANAAAAAAAAAA 

Sequence 759 

GGCGGCCGAGGTACAMGMAGGACTTGATAGCTATTACCTTGCTGCTATGTTTGTTNCT 

TNGNCTCACCAATCATTNTTNTGTATACCTAGCACTGCACCAGGCGCTGAGGTTAGAGAA 

ATAACTAAAACTGCGCCCTTCACCCCTGATGGCAGGATAGGCAAGGTTGGCACCATCGTC 

ACAGCAGGACCCTCATCGATGCCTTGGTGTGTGCCTGGCATGGNGTTTGCAGCAGTTTAT 

CACATNNAATCCTTACAGC 

Sequence 760 

AGGTACTCAGGCCTTACTGGGATTTCCTTTAAGACCTCTGGGAGGAAGTGTCAGTAGCTG 

GGCAGGCCTTQTTGGCAAGCATTCCTCCCTGGGTTGTGGCGGGGGCTCCCGGCCTGCTGT 

GTGGCAGCTGCAGGCTCCTGGGGACCTGAAGGAAAAGCTTAACCGTTCTCCCTTCCCTTG 

CTTGGCACTTAGAGCACTAGTTCCATTCCAGACATACCGATTATCTTGCCTACGTGGCAT 

AGAGGCCTAGGAGCCTCCCTGGGAGGAAGAGGCAGGCCAAGGTCTTGCCTGGCTGCTTTT 

AGGGGGAAAGATGTAGGGAGGAAGCTGCCTTATGCTTGGATCTGCAACGTTTGCCTGGAC 

CTGC GGAG CCTATTTTGGCCAGGGGGAGGGAGACAGAAATTANACCCNANGTATTNAGGT 

AATCCTTTTNTTGCCTTTGMCATTGCNCGGGNGTACTTTGNAAAANAAAAAAAAAAA 

Sequence 761 

CTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 

GGMTAGTTTTCTAATTACAGMAGAAAAGAAGTTGGAGNGGGTTTCGCCATGTTGAGCA 

GGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATT 

TCGGCCAGCTTMTCACACACCAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTAC 

ACTTCCTTAAATCTTTGATTTTGTCCTATACTTC I I I I I I I ATATTAGCAATCTACTTTA 

GGACAGMATTTACTTTCCTTTCCTCTTGATTTTGACCAAAGTCCTCTCTTATGCAAAAT 

G AAAAATTACTCTTTTTTCMCTTrCTTTACCAAAAATACATCCTCATAAC I I I I M I CC 

ATCTCTCCTACTTACTGG 

Sequence 762 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 
GGMTAGTTTTCTAATTACAGAAAGAAAAGAAGGGGGNGTGGGNTTNTGCCATGTTGAGC 
ANGCTGGNCTCGAACTCCTGACCTCAGGNGATCAGCTCGCCTNAGCCTCCNAAAGTGCTG 
GGATTACAGGCATGAGCCACCACGCCTGGCCAAAAATNTTATNAATAATCCCCTTCTAAT 
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TNCGGNCANCTTAATCACACACCAAATTCCTTTCATGAGATTAAACTNNCACAANTNCTA 
CACTTNCTTAAATNTTTGATNNTGNCCTATACT^ 

AGGACAGAMTTTACTTTCCTTTCCTCTTGNTTTTGACCAANGTNCTlvrrCTTNTC 

TGNANAATNNCTNTTTTTTCAACTTTCTTTACCAAAA 

Sequence 763 

TTAGGGCGATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATGTAATGCTCCTGAACTGT 

ATGCTNGACACGGCTGTCTACNTAGGTTTTGTTCTGTGTATTTTATGACT AI I 1 1 I 1 1 AA 

AAAGTAAACAAAAAAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAA 

AGCAGGTGAGTGGTTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCAGAATGGCC 

ATGGGCTTTGCCTTCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGT 

TC AACA CTGCCAGTGTAGTTAATGTCACTGAATTATACACTTTAAATGGCTAACATGACC 

MTTTTATGTTATATATATTTTTACTACCACAAAAAAACTAGCTGGCACCTAAAAACATTC 

CATT 

Sequence 764 

CCGCGGTGGCGGCCGAGGTACTTGGATGGGTTTTGTGTGTATGTTTGTGTGTGCACTNGC 
GTCCACCCTGTTGGGCTTAGTGMCTTTTTGATTCAGTGATTTAAAGTTTCTCATCAGAT 
TTGGAAAATTCTCAATTAC I 1 1 11 1 CTTTAAATATTTCTCTTGCCCTTCCCCTCTCTCTT 
CTTCCAGGATTCCAATTTCATCGATGTT 
Sequence 765 

TACTTAGGGCGAATTGGAGCTCCX^CGCGGTGGCGGCCGAGGTACGCGGGGGGTTCAAGCG 

ATTCTCATGCCTCAGCCTCCCAAGTAGCTGGGACTAAAGGTATCCACCACCACGCCTGGC 

TMTTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGATCAGGCTGGTCTCGAAC 

TCCGGCCTCAAGTGATCTGCTCGCCTTGGCCTCCCAAAAGTGCTGGGATTACAGGCATGA 

GCAGCTGTGCCCAGCTGGATAATTATTTAATAAATTGGGGAGCATAGGAAGCATAGTATT 

TGTGMGTGGGTAGGCAGGTGTGATGGGGGTAGGTGATGTTACATTTGGGGCATTTTGAA 

GTTG GTGGTTCTTCTGAGTTGAGCAGTCAGTCACTCTTCATTTGCTGCACC7TTATCTCA 

TTTTAGCCAACAGACATTGAATACCTACCAAGTCTTAGGTATTTGCA 

Sequence 766 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAAA 

GAAAACAAATACCAAGTATTTACAGATCCAGAGAAAGTTCACAAGAATGGGAGGATGCCA 

GTTCCAATGCTTTGTAAAGTCAAAAATAGCCACATTGCAAAACAAACAAAAAAAAACGAG 

AACGTTCCCGAGTGTGCCTCCAAAACATAAAGGAGAAAATCATACAGAAAAACCTCATGT 

AAGGGTTGGAACTTGAGCAACCAGCTATCCAAATACAGAGGGGAATCCTCGCTTAGCTAG 

GGCATGGCCTGAG 

Sequence 767 

AGGTACACACAGTGATTTGGGGTCCTTTTTCCTAMACAGCTTCTTTATCAGGACTTTGG 

MTTCTGGGTGAGATAGAMCACTGAMACAGGGCGGAAGTTTTTTCTTCTGGCTTCTTA 

GTCCATGGAGGGCTCAGCGTGGAGAGGATATGCCGTGGCATTCTCCCTGGGAGACCACAC 

ATGTTCCCGACAGCTCAGACCCCAGACCCGCACATGCTTCTTGACAGTTNAAACCCCAAA 

CCGNAGGNGCTCCCGACAGNTNAAACCCCANACCCCGCGTACCTGCCCG 

Sequence 768 

CCGCGGTGGCGGCCGAGGTACTTAATAATTCATAATTTAGCCATGATAGTATCTAAGCTC 

ACTTTCAGAATTATTGCATACATGCCTTAGGGMGMCCTATCCACTAATGCTTTTAATA 

ACTTACATAGATTGTGTTGCGGCMGTCMGTTTTMTATAGAGGAAAGGGTTTATCTTA 

TCATAGTAAAATAGTAGTGATGTGTTCATMTTTACTATTTGCATGGTATATTATCAAGG 

CTGTAMAGCTTGMTTTGCCTTTTCCACATCTTCATTTCAMTTMTTTT^ 

CCAGAGAAGTGGGTAGAACCCAAATGCCCATGNGGGT 

Sequence 769 

GGCANGGTACAAATTCAGGGGAGGATGGAGCAGCTGCAAGCCTGGCTGGCATCCCATGCC 

AACAAGGTGACCCAGTAGATAAGTGACAAGGTGACTGAGCTGCCTGGTGCTCTTGCATAG 

AATTTTCMGTGTTAGAAAAATGTCTCCATGCCTTGCAATTTGTCTCTTGTGGCCAAGCC 
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TAACTCAGATGGAAATGCAGAAATCACCCGTCTTCTGCGTCGCTCACGCTGGGAGCTGTA 
GACCGGAGCTTGTTCTAATTNGGCNATTTGGGTTCNn"CCCCCCCGGGNNCNTN 
Sequence 770 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI I 1 1 I 1 i 1 I 

I I I 1 1 I IS I I I IGAGGNGGAGTCTCACTCTGTTGACCAGGCTGGAGTGCAGTGGCACAAT 

CTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAACGGGTCTCCTGCCTCAGCCTCCCA 

AGTAGCTGGGACTACAGGCGCATGGTGCCACTCCCGGCTAGTATTTTGTATTTTAGTAGA 

GACGGGGGTTTTACTGTGTTGTCCAGGCTGGACTCGAACTCCTGAGCTNAGGCAATCCAC 

CAGCCTCAGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACCGTGCCTGGCCTCTTT 

CATAI 1 1 I I I 1 1 ACACTTTTCATTTCTTCTTATTTTAAGTGNGCTGGATAGGGGCTCCAG 

MCAGMTTCAATAGAAAGTTGTGACAGTAGGMCCCTTATCTTGGTCCCTGATTTTAAA 

GGAGGGTTNAAAAAAAACCCCCCCCAA 

Sequence 771 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACI I N I 1 1 1 I I 

I 1 1 I I 1 1 I I IGGAGAGAGGGGGTTTCACTATGTTGCCTAGGCTGGTCTTGAACTCCTGGC 

CTCAAGAGATCCTCTTGCCTCATCCTTCCAAAGTGCTGGGATTACAAGCGGGAGCCACTG 

TGCCTGGCCTAGMGATCTGTTTTCTTTTCTCTGAATAATTCTTGTGACACTGTCTCTCC 

CTCCATCTCTrTCTGTTTCTTTGTCATTTTTCCCAGCTATC C t 1 1 I I 1 CCTTGNCTTGTC 

CTCTTCTCCCCTCCATCCTAAMCCTTTGATCACMGCTAGTTTCCTTTCCACATCATCT 

GCTCCCCTCTACTAAACGCTATTTCGCCCCCACCTGCTTTCAAGCTGNGCTTGCCTCTGA 

GCCCCTCTTTCACCACGGCCCAAA 

Sequence 772 

NCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCATGCTAGACAA 

CCTTATGACTTGAAAACAAAATAATATTTGAAATGGAAATGGCCTCAGTTCCACCCCTGG 

TGCCCACATAGCATAGTGAACCTGCCCCTGCAGCATTGCCCATGAGTGCTAAGATCCTGT 

GCCCATTTGCATGTCTTCCTTAAACAAAAGACCGCCTTAGTAAGAAATTAGTAAACCAGG 

GAGATAATCMCTTATCCCCCAAMGATTTMGCCTCTCATTTTGTTTAACCTTCATTGG 

GGATTTTAAATAGAAAACTAGGGCCCGGCAGGGTGGCACATGCCTGTAATTCCAGTACCT 

GCCCGGCCGCTCTAGAACTAG 

Sequence 773 , 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGG 

ACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTTCGCT 

TCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCT 

CCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCT 

CATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGT 

CATCAGTTTATCAACCAACAAGCAG 

Sequence 774 

AGGTACGCGGGGAGTGAACGCTCTCGGAGAACCCTTTCCCACGAACGTCCACTTCAAAAG 

AACGCGACGGAGCATTAACTCTGCCACTGACCCCTGGCCTGCCTTCGCCTCCTCCTCTTC 

CTCCTCTACCTCCTCCCAGGCGCATTACCGCCTCTCTGCCTTCGGCCAGCAGTTTCTATT 

TAATCTCACCGCCAATGCCGGATTTATCGCTCCACTGTTCACTGTCACCCTCCTCGGGAC 

GCCCGGGGTGMTCANACCAAGTTTTATTNCGAAGAGGAAGCGGAACTCAAGCACTGTTT 

CTACAAAGGCTATGTCAATACCAACTCCGAGCACACGGCCGCTCTAGAACTAGTG 

Sequence 775 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTACTGGCATAAGAGTAAATTGGTGAGAACT 

TTCTGGAGGGGTAGTTTGGCAATGTGTTTCCAAAAAATCTAAAAATTATATTTGCCTCTA 

ATCCAGCAATTATACCTCTAGAAATTAATACTAAGGAAAATCTTAAGAATATACCGTAAA 

ACTTTAGTTGTMGAAATTriTTTGTGGCCAGGCATGGTGGCTCACACCTGTAATCCCAG 

ACTTTGGGAGACCAAGGTGGGCGGATCTC CTGAC CTCATGATCCACCCGCCTCGACCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCAAATTTTAAATAAGAAGAAACAGTCAACAGCAT 

CAGACATAGTAGGTATGTCCAACACCATAATGGCTGAAAAGTGCCCCCTAGTCTGGCAAT 
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TAGTAGGTCATTGGTTTATTAATAACCGGCATGTTAAAGTTG 
Sequence 776 

CCGCGGTGGCGGCCGGCAGGTACAMTCATACCTCCCMGGTATTGCTCCATTGTGTTTT 

TGTGCATTTGGTTTGGATTTTTATGGGGAATTGAAGACAAGTGGATCATAAAGTGCAAAA 

TAAAATGCTCTAGAAATGACAGATGGGGCACAATTTCCAAGAAAATTCATCTAGACAGTG 

GCAACACTGAGAAAAAAAAGAAACATTCAAGAAG 

Sequence 777 

GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACI 1 1 I I I I 1 1 1 I I I I I 1 1 1 1 

TGCTAGAGATGGGGTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGGCCACAAGTGA 

TCCACCTGTCTCAGCCCCCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACTCCTGGC 

CCATGTTTAGGATTTATACCAATATTATTAACTTAGAAATAAGTTTCTMTAAATTATTC 

CACCCGMCTTAGGGTAACTGAATTTTAATGCTGATGTATTAAGCAGGTTCTTCCTGGGG 

TCTTTTGATTCTCAAGGGATCCTTCACTGNGGGTGGACTTCAAATTAATAGGAAGCAGGA 

AGGAGCCACCTGCACTGTTTTCTTGACTGGGGATGACACCNAAACCTT 

Sequence 778 

CCGCGGTGGCGGCCGAGGTACTATGAGAATTTCAAACAAAGAATGAAGCCATAAAACAAA 

MGACTGMTATTTGGCTCTGCCTGGCTCCCAGGCTTTCTACTATTCTTGTGACTTGGCC 

TCMCAAMTCTAAAGTGACTTGTTATTTGTGGGTCAGCTTTGTCCCATCCTTACCAGTC 

ATGGCTTTAGACAAAAGACTCAGCACCACTCACCCTNTGGGACAGTNTGACTGNGGTCTG 

AGNCCCCTGCTTANATATTAGGCTTAAGCTCAGTT 

Sequence 779 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAGAATTTCA 

AACAAAGAATGAAGCCATAAAACAAAAAGACTGAATATTTGGCTCTGCCTGGCTCCCAGG 

CTTTCTACTATTCTTGTGACTTGGCCTCAACAAAATCTAAAGTGACTTGTTATTTGTGGG 

TCAGCTTTGTCCCATCCTTACCAGTCATGGCTTTAGACAAAAGACTCAGCACCACTCACC 

CTCTGGGACAGTCTGACTGTGGTCTGAGGCCCCTTGCTTAGATATTAGGCTTCAGCTCAG 

TTCC 

Sequence 780 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACAGTC 

AGCATGCGCTGTAGCAATGTGCTTTGCAiGCTGGAACTCCTATCAAGCATCCTAGGCAAGG 

CATGCACCCCAGCGCCAGAGAGAATCAGGAAGGGGAAGGTGCCCTGAACCTCAGACAAGA 

ACCCCTTCCAGAAACCACCACCAAAGCCATCACTGTGTTTCCACCCTCAGACCTGTGTCT 

CTTTAGCTTCTTGGTAGAAGGAAAGAAGAGGAGCTTGGGTGGGGCAG 

Sequence 781 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCATGGACAT 

GCACCCCGGCCACAGAGAGGCGCACAACTTNTACAGAACACCCTTCCACCTGGTCTTCCA 

CAGCTGCATCAGATTCCTGGACTGTCACAGACATGACTTCAAACTTGAAAGTTGCAAGAT 

CTCCTGGAACAATTTCCACMTGCATACMCTTCATTCTTAGCCTCAAGCACTGAATTAG 

ACTCCATGTCTACTCCCCATGGCCGTATAACTGTCATTGGAACCAGCCTGGNCACTCCAT 

CCTNTGGANNNTTAAACNTTNAAMANNNAMNCNNNCCCCTNGGCNTTTTTAAAAANNN 

GGNNNCCCCCGGGNNNGGGAAA I I I 1 1 1 < I I AAANAI I 1 1 I H I CCCCCCCCCCCCNGGG 

GGGGGGGGCCCCCCCCCCCCCCI 1 1 1 1 1 1 

Sequence 782 

NAATTGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTCCCTGAGCAGTCGAAGTGG 

ATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGA 

AAAAACTTCAATGGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCA 

ACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCA 

ACAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGAC 

Sequence 783 

CNATTGAGCTCCCCGCGGTGGCGGCCGAGGNCTGATGTCCTACAGTCCTCTACCTGATCT 
ACGTTCACTGGAAAGTGTNGAGTCTCAGCAGGAAGCACCTTGCTCTCGTGTCCGGCTAAT 
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TCGAGTGCTTTACGTMGTAGAGGAATTGCTGACTTTTGGGACATTTCTGGTCTTGCCAA 
AGTTCACCTTGTAGTAAAGCCCCCAAAGATACTTCCCAAATAGATGCTCTCTTGAAAATA 
ACTCAG 
Sequence 784 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCGGCCTATTTTA 

TATTCTGTTTAGTGCTTTCAACCCCGAMTCCACCTTCTAACATTAATATTGATATCCAT 

CCTCTCTCTCCTCCMCCTCTCTCTCTCTTGCATTTACTTTTAGATTTTCATTACTTTCT 

TTTTATTCTG ATTCTTGTAAATAGTATAAACTAG ATTCTTTTA I I I I I ATTTACTTTTTA 

MTTTATGATTGACACATMTMTTGTATATATTTATGGGGCACAACGTGATGTTTCGGT 

GCATGTATACATAGTATMTGATCAAATTAGGGTAGTTACCATATCCATTACTTTAAACA 

TTTATCATTTCTTTGTGGTAACAACATTAAAAATCTCATCTAGAATGGCGTGAACCCTGG 

AGGCAGAGCTTGCAGTGAGACGAGATGGGCGCCACTGCCTCCAGACTGGGCGGACGAGCG 

AGACCTCCCNTCTCAAAAAAAAAAAAAAGG 

Sequence 785 

GCTCCCCGCGGTGGCGGCCGGCAGGTACGAAATGAGAGAAATGGTTTAGTAAACGTATAA 

GACATCAACATAGNAAAGTATTCTATAGGNNTATGTGTTGGAATTACAAAGATGAAGAAA 

AGATACAGGCAAGTATTTGATATACTNAATTAAAAATAGCAAGATGTAGAGTAGNCATGT 

ATACAGTGATAGCAAGAACATGGATCCTTAAGGACAAAACTGAAACATAATGCAAAAAAA 

GAAAAATATGCAMTTATTTTTCGTATGATGTAAGTTGTAAATAT 

Sequence 786 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGTACNCGGGCCTATTTCTGAA 

TAACTCAGNGGCTTAAAATTATATCCCAAAGTAGNGGTATCACAGGGTTTCCTGATGAGG 

ATAAATGGGCCTGAAGTGCTTATGGGCACCCACTATGTATCATGGNAAAACTTGCACGTG 

TGTGTGTGTGTTGAGAGAGAGAGAAAAAANAATAGANAAAGTTGGTGAGAAAAGGGNAGG 

CTGI I I I I I GNNCCGAGGGNTGTNTGGTTGGGCTTT 

Sequence 787 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGNACACANTAAGTGAAGGGCC 

AAGACTGACGGCTGATAGGACAGGGGTGACCAGTGGTGGGGAGGGTAGTGGGAGCAGTCC 

ATCCTGGAATCTGGCATTCAAGGGGCGCATTGTCTGTGGAGGATTTAAAAATAATAAAAC 

CAACTAAAGGCAGTCTGCTTTTTATGGTCACCAGGCGCCAGCAATTCTAAATTTCAGTGA 

TAAMTATTCCTCCTCACTGGACACGAGAAGCTGGCTTNTCCTATTCCCCAGTACCTGCC 

CG 

Sequence 788 

GGNGGCGGCCGCCCGGTTTTGGACGCGGGTNTNTGCCCTNACI I I I I I AGCGG AGCAG AG 

GAAACATTCATAAGGAAATATGCGAGTAGAGCTCAGGAGAAAAGCAGGACTAGAGGCCCA 

AGAATCACAGGCCAGAAGAAGAAGCTGTAGCCTCGGGAATGGAAGAGCTCTCTGAAGGGG 

AAAGGGGAGAACAGGAATGTNCCAGGAGCCAAGGCTCATCTATAAGGGACTTNCACATTT 

AGGATGTAGAAGAAGGAAGCAGAAGCAGGGGATGACCAGAAATGGCCCCAGAGATGAGAT 

GMAGTTAGGAGAGCGGNGAGCAAGCCTTTAGGTTTCACAAGGGAAGGAGGGAAAGTAGG 

TGTTAGGTGCTGCCMGATCAGGGAMATMGCAGAAGACCAGGCCATTTTNANTTGCNG 

TGG 

Sequence 789 

CCGCGGTGGCGGCCGAGGTACCACAATCAACTCAATCACAACATATTACAACAAAACCCT 

TCATCI I I I I CCTTAACCCACTGTAACACAAAGCAG AGAATACAGATTAGC 1 1 I 1 1 I ATT 

TGTCTGTTTGACTTCATCTCTTACATACCTCTATTTCAGTATCTATGATATTTTCCTCTT 

CTTATCTGTTCMTGACAGTCTTCCCTTTTAATATTCTAAACTTGTCAAGCACAGCANTT 

MMAAGTATTCTCATGTATATATTTTATCTTTAGAGCATCGCATAAAGNCTGATACATA 

GGMGTTTTAGATGCATATTTACATTGGGTAGATGAATCCAGGGGAAAAG 

Sequence 790 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCAAGAGAGACGTCTCTTACTGCCTCATT 
AAGCATTTGGAGCTGTTAAACACAAATCAAGGCAACCAGAAAGGGCATCTTGGCTTCAGG 
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CTGGGCATAACCATCCCATTTGCCACATAAAAGTCTAGTGGCTACTCTGCACCCTTTCTG 
GGTAGAAGCAGAGTTAGTTTGGTCATGGGGGCCCCTGTGGGACAGTGTTGCCCAGACAGG 
TACCTCGGCCGCTCTAGAACTAG 
Sequence 791 

AACCCACTATAGGGNTNATNGGAGCTCCCCGCGGTGGCGGCCGCCCTGGCAGGTACTGNC 

TGTCTCAMTTTTTGGATACGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGAT 

CCTAACAGGTGATGACTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCT 

CAATGTGCCTAGATTGAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGG 

TGTGGTGTAAACACCTACATTTTAATGTAAACATTTCAGTGTAATCAATGAGAACTATCA 

TTCTGCTTTAATCACCATGAGTTCTGAMTMCAAAGGATTTGTCTGACATTCATTCTAA 

GMATTCATTCTTACCTGACTMGAMCTTTTTAACCTGGCACAATAATAAGAAATGACC 

TGGNAAGTACCTCGGCCGCTCTAAAACTAAGGTGGGATCCCCC 

Sequence 792 

NGGCGAATTGGAGCTNCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAAAATGTAA 

GTTGGTAAAAGACATTGGACTCCAGCTATGTTCTTTAGAAAGAAGGTATTGGACTCTGGC 

CATGTTCTTCAGAMGAACATGCCTGGCTTTTTCACGATTTGATCAGTCTTCTTAGACCC 

TGMCCCCACCATGAMTGGCTTCCCCAGACACAACCCGAGAGAG7TATGCTTTGTTTCT 

CAGCTAAAATATTTTGCAGATCTTAATTTCCTGGGTCATTGCATC A I I I I I 1 1 I I I I N I 

T 

Sequence 793 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCC(XiGGCAGGTACCMGTATACCTTTT 
CATTTAMTCATTGMCAGTTCACAATGGCTGTTGTAAAGTTTTTGCCTTGTTACTGAAA 
CC ATCAT CTCTTGGTCTATTTCTATGG AGTTA I INI I CCTAGTTATTGGCAGTTTGCTT 
ATCTTTTATGTGTCTMTAACTTTTAATTGAATGTAGAACATTATGGATCTTATA I I I I I 
GAGMTCTGAATGmAAMTTCCmGMGAGTTTGTGTmCTTCTGGAGGCAGTTTA 
CTTACTGGCTGTGAGCTCAGTCCTGTTGAGGCTGTTTNGCGACNNNCTGTGGCTTTGTCA 
GGGTGGGGTGGTGGATCAGACATCTGGTCATCAAC 
Sequence 794 

ACNCCCGGCCAAAAGGGAGNNCACAGGGGGGGGCCAATATATAAGGGGGGCAAGGAGGGG 

GGGNNGGGNAAAGANGNCAAACCCCNNCCCNNGGGGNNGGGGGGGGAGGGGNAANAAAGA 

AGGNGGGGGNiSNCCANGCAACCNAAAGGGACNAACCCAACANNNAAGCGGGGGAAGGGGG 

CACGANAGAANANANANNANAGGNGANNCAAGAACCAANCAAGGGNGGGGGGGGAAGGGC 

NNCAAANGGGNNNAAAAANNGGCNCAGGNNACCNAANGGGGGGNNGGGGCNNAGAANGGG 

GGNGGGGGNNANGGAAANNAAANNCNGGANNNAACCAAGGGGAAGGGAGGGGGNGGGAAA 

NCAAGGCCCCCGGGGNCAAAACAAAAAGNGGGGGGAAAANACAANCGGGGNNNGGNGNCA 

GGAGGGGAGGNNANNAGCCGGCGCGNAAAAAAAAAAAACAANCCAGGGGCAANGGGGNNG 

GGGGNGGCNAANGCCNCNNGGGGNAAANCCCNANGGGNGNACCCNNGGAGGGNGGGGNNN 

GGGGGGGCNCNGAAGGGAAAAANAACCCCCAAGGGAAAANCCCANGGAANGGGNAANAAG 

GGGGGNGGCAAGNGGGNANCCCGNAAAAANAAAGGGGAAACNGGGGGANNGNCCCAAAAC 

CCCNGG 

Sequence 795 

CCGCGGTGGCGGCCGAGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAA 

CTAAGAAAACACCAACCATTCAACAGTATACTAGAATTCCTTTCAATGCATAATAGAAAC 

MGMGGGATTAGAAAAGCATGTCATAATTTCCAGATAGCATAATTATTTACATTAAAGA 

TCCMGAGACTCAGACCTAGTAAMGATTTTGGCCACATTGTGACATTTGAGATCACATT 

AAAAAAAAAAAGGAAAAATCAAGNGATACTAACATCACCAATTAACATCATNAATTAGAA 

MTTTATCCATCAGCNGTATTNNNNNGGGCTCAATGCNTTGTAATGCCGCATTTCGGGAG 

GC 

Sequence 796 

CCCCCCCCGGAGTTTCAGCGAAAANCCCGGCCGAGGNACCNCNCCNTTACNCCCAGGNTT 
ANAAACNCCCNGGTNTNGNCCCAGGAGAGGCGGGGGANCACCGCGCCNGGCCAGCANGGN 
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CCCCTNCTAAATNNNGNNGTNNAAGGAANCNGGACCCCAAANNCNCGCCCCCCACGAAAG 

GAGCCTGGGAACTACCAGGGCGTGAGCTCACCGNGCCCGGCCGCTCTAGAACGAGAAGGA 

ACCCCCGGGCTGCAGGAAAAACNATAACAAGCNAANCNAAACCCGACCACCCCAAGGGGG 

GGGCCCGGGACCCAGCTGTTTGGGCCCTCAANACNAGGGAAAAATTGCGCGCTAAGNGCN 

CAANCANGGGCCAAAANGCTGGNTNCCCCGGAGGGGAAAAAAGGACACTCCCGCGNCACA 

AATTACCACNCAAAACATAACNGAAGCCGGNNAANCAATAAAANNGGGGAAAAANACCCC 

NGGGGGTGGCCCTAAAAGGAGGGGGAGCCCCAACCACCAACAAAATAAATTGGGGG 

Sequence 797 

AGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAACTAAGAAAACACCAA 

CCATTCMCAGTATACTAGMTTCCTTTCAATGCATAATAGAAACAAGAAGGGATTAGAA 

MGCATGTCATAATTTCCAGATAGCATAATTATTTACATTAAAGATCCAAGAGACTCAGA 

CCTAGTAAAAGATTTTGGCCACATTGTGACATTTGAGATCACATTAAAAAAAAAAAGGAA 

AAATCAAGTGATACTAACATCACCAATTAACATCATCAATTAGAAAATTTATCCATCAGC 

CGGGTGTGGG 

Sequence 798 

GCGGCCGAGGTACAATTCAACAATTNNTGGTCCAGGATCATGAATGGGCCATTNNTAGTT 

CTGTGTGTGCTTAAACACATTTTTTTGTGGGGTGCTGTGGATGTGTGGATGTAGCCAAAA 

AAAACCCTATTGTGGGNTNGGTCCTGGGGCAGAAAGTCTGGTGCCAGAGAGTGGGGTTCT 

GGGGGTCTGTCTTCATAGTTTGGGGTAGCACTAAMTCCTGTGAGCCTTTCTGGGCCTTG 

GTMCCTCCCCTGTAAGTTAGCTGTTAGATAATTCAGCTGGGTAGCATTTTATACCTGGA 

TGATGTTCTAAAGTCCAGCCACANAAGGCCNNNGTCTGGCAGAGTGAGAATTNCCTTTGA 

AGAACCTTNAAACTGNTNCCCNAGAGTGACACAGGGGNNCCTNGGGGAAAANCNAAAAAG 

NNNTTGGGAATTCTNTNCAAAAGNAAGNCCCA I I II I I II I GCCN NN ATTTNGGCCCNCG 

NTAATNCCCCNCCCCAAGNAAAAANMAAAAAAANTN I II I I I I II I 1 1 I 

Sequence 799 

CGGTGGCGGCCGCCCGGGCTTGGTACCCTCTGTCACGGCTTCCCTTTGCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCAGANATTGTTGCCTCCTGAA 

Sequence 800 

GGGCGNTTTGGAGCTCCCCNCGGTGGCGGCCGGGCAGGTACTATCTGGAACNTGTAGCTT 

CCTTTNGCACTGCAGCATGGGMGCCAGAGTTGATGATTCATACACCAGCATTTACATTT 

TTCAGCATGAAAGTGGTATGTTCTTCAACTCACAACCCATTGGCCAGAACCAGTAACATG 

ACTTCACCTAACTGCAAACTAGCTGGAGAATTGTGGGAGAGCTCATGG 

Sequence 801 

CCGCGGTGGCGGCCGAGGTACCATTTAGCACACAATTTCCATGTCCCAAAAGCAACCCCC 
AT AAAC AGTG ACTA 1 1 I II I ATGCTGTTTTTCTTTGCCCCMCACTTTTATCATTTGATA 
TGTTATATCTTGC I I I I I I 1 1 I I ICI I I I I I AATGGAGTCTCACTCTGTCACCCAGGCTG 
CAGTGCAGTGGCGCGATCTTGGCTCACTGNAACCTCTGCCTCCTGGGTTCMGCAATTCT 
CCTGGGGGGTGGGGAGGTTTGCAAGTGATCCAAGATTGCGGCTCTTCACTCCAGACTGGG 
NGAAAGAACGAAACTCCATCTNAAAAAAAAAAAAAAAAANAAAGTACCTNCCCGGGCCGG 
CCGCTCTANAACTAGGTG 
Sequence 802 

CCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATGGTGTCAACTTATGTCAGGACCCATG 

GGCCCTCCCCATGCACACAGCACTCTTGGAATCTCATCCTTTTCCATGGCTCTGGCTCAC 

ACTTCCACAGCATTTACTCCTAAATATGCCCCCTGGGTTCAAGGGTGATTCTCGTGCCTC 

AGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGCCTCGTAI HUM I GTGT 

GTGTGTTTAGTAGAGACGTGTTTCACTATGTTGTCCGGGCTGATCTCCAACTCCTGTAGT 

CAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTATAGACATGAGCCACCACA 

CCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAAGTGTGGGTAGTCATTAGTGC 

TGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGCACTTCCCGTCCACT 
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Sequence 803 

AACGACCGCCCGGGCAGGTACTAAGCCAGGACCTCAGTTAGCACTAAGCACTCTTACTAT 
TGCCCCCACCTGGCACAAAGCAAACTGAGATCTTAGTTGGGCCCATCATGTGTCATCTGA 
TTGTCTTAG AAGTTC 1 1 I I i I I CTMGACAGAGTTTTGCTCTTTTGGCTCAGGCTGGAGT 
CCAATGGCACAATCTCGGCTTACTGCAACCTCCGCCTCCCAGGTTAAAAGCGATCCTCCC 
GCCTCAGCCCTNCGAGTAGCCGGGACCACAGGCACCCGCCACCACGCCCGGNTAACCTT 
Sequence 804 

TACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTCTCTAT 

GACTATCMGCTCAGGCCTCTCCCTTTTnTAAACCAAAGTCTGGCAACCAAGAGCAGCA 

GCTCCATGGCCTCCTTGCCCCAGATCAGCCTGGGTCAGGGGACATAGTGTCATTGTTTGG 

AAACTGCAGACCACAAGGTGCGGGTCTATCCCACTTCCTAGTGCTCCCCACATTCCCCAT 

CAGGGCTTCCTCACGTGGACAGGTGTGCTAGTCCAGGCAGTTCACTTGCAGTTTCCTTGT 

CCTCATGCTTCGGGGATGGGAGCCACGCCTGAACTAGAGTTCAGGCTGGATACATGTGCT 

CACCTGCTGCTCTTGTCTTCCTAAGAGACAGAGAGTGGGGCAGATGGAGGAGAAGAAAGT 

GAGGMTGAAGTAGCATAGCATTCTGCCAMAGGGGCCCCAG^CTTAATTTAAGCAAA 

CTAAGAAG 

Sequence 805 

CCGGGCAGGTACAATGGACTTTGACAGTTCTTCCCAAACAGATCCTAATTTTAAACATTA 

GGTTTGCTTTGATTCTTTTCCTTGGGGCTMGAGCTCACAMGACTTAGGTTCTGGTCAT 

GGCTCCAGAGGCCACACATTCCAGGACAAAGTCTCTCTACAGTCAACGCCTTAGTCCCAC 

ATCTGTAAAATCGGAATAATCATCCCTGATCCAGCTATCACATTGCAGTAGAGTGAGACT 

CAAATGAGATAATGGAAGACAGTGGGAATGATCATTTCCAACTTGGCCTGGCTGACCCAT 

TCCTTGTTCTAAAGTCAGCTCAGGTTTCACCTCTTCCAGNGAAGTTGACCTGGCACTTTC 

TTTTAGGATGGCTACTGCTCCTCTGGGTGCCCCGGGGCTCANTGTCTCCCCATCACCGCC 

CATGGCACACTTGGAGTGACTGGTCCTTTACTTTGNTT 

Sequence 806 

TNCGGGCAAGGTACATTGGCCCCAAAGAGNAGGAATTCCTTGTAGAGGAGCTTGTAGATG 

CTTNCCCTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTG 

AGAGTGTATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCC 

CCAGGGCTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCACACACCCCA 

GCCTTCACCCTGCGTCAGTGGACAAGGGGGTAGGAGCCTGCAGAGCAGAAAAGTACCT 

Sequence 807 

AGCNCCACCGCGGTGGCGGCCGANGTACGCGGGATATGTAGAACTTCACNNGTTTGAAGT 

TGGCTGATTAAATATACTAAGTATTACTGMTCACTGCCCTGCCTTTTCTGCTTTCTTTA 

CAG ACCTGTTTAGTATACACTGTATGTA I I I I I I I I I I I I I 1 1 I G AG ACTCCGTCTCAAA 

AAAGAGAAAATTATGGGCCGGGCACAGTGGTTCATGCCTGTMTTCCAGCATTTTGGGAG 

GCCGAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACTAGCTTGGCCAACATGACGA 

AACCCTGTNTGTACCTGCCCG 

Sequence 808 

CCGCGGTGGCGGCCGAGGTACGAGACTTGTCACCATGTGACATGGCAGCTTCAGAAACTT 

AGCCACTGCCAAAAAAAGAGCAGGCAGGGATAATGTTGTCCCATTGTCCAGTCAGAGAGA 

CCTGTTGAGTCTCTAGTTTGCCAGTCCCCMGAGACCTTTGGAGTTTTGCTGGAGCCAGA 

CATCCTGCTTAGAGATGAGGAAGATCCTGCTGTTCCGTGGGGAGCTCTTGAGACACCCGT 

GCCACCACCCACCTTCTCCTGATTGCCACTTGCTGCCCTTTTCCCATTACCCTCTCCTGA 

CTCCATMACATCTTCAAGTCTTCCCTTTCTCCACCCCAAAAAATGCCCACCTTGGAAAG 

GG 

Sequence 809 

AAATTMTTGGGGTTGNGCTMCTGCCCGGTTTTCAATCNNGNAAACCTTGTGGGGCCCA 
NNT GMTT MNANMTNGGNCCACCCCCCCGGGGAAAAGGGNGGTTTTNNMNTTTTGGG 
GCCTTTTTCCCCI I I I I IAAAAAA 
Sequence 810 
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CCGCGGTGGCGGCCGCCCGGNCACGGTACGCGGGGATGTCTTCTGAGAGAGTCAGGGCAG 

CTGAAGACTGGGTGAGGGTGAGGGAAGCCGCTGGTGTCCTCCTCAGTCACCCGTGAGAGG 

ACTCCTNTGTGGAGCTAATCAACTGCAAGGAAGATTGTTCCCAGTGTCCAGACCTGAAGG 

AGTCTGGACCCATAGTGCANTGAGATTTGGGGAAGGAAGGATTCCGGATAGGGGTGAGCT 

TTNTGNTGATAAGCAAATGTGAAC 

Sequence 81 1 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGTTTATGGGAAGCCAGTAACACTG 

TGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTA 

GAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGAT 

CCGCCTGCTGTTGMGCCCCCTTNTNATTCCGATCGCTTTTTGGCCTTGATGATTTGAAA 

ATA 

Sequence 812 

CCNTCAGGTACCAGANCTTAGCAGGGATTTTGGACAACAAAAGCTCTAAATCCTCTTGCA 

TCGACACGTTCAATTTGCACTGACCAATCTGTTGGCACAGTAACTGTTTATAAGCTAAAT 

TTCTACATTTTGGCTACAAGTATCCCAMTCCACCTTTTAAAAAATCCTAGGTAGATGCC 

ATCT GGTGTTMTGATTTGCACACCCCTTAMTTGAAAATATTTTAAATAAATCTCACGG 

TTTTATATAGTATCATTAATGTGTCTATTTTAAAAAGACAATCTGAGAATAACACTTCCC 

CTMTT GTTGT CTTAATAATGACCAAGAGCTGAGGAAAAATGATTCACACTGTTAGTTGT 

TTTGTTGTTTTGCTCACGGGGGMGGGGGTGAAGTACTGGCTGTGCCTGGGTTTG 

Sequence 813 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCC 

CAGCCAGGCACCACCAATTACCAGAGGAACAAA 

Sequence 814 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 

GAGCAGTTATCCMCAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 

TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AMTCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATCACGTCTGAGATGGGAAAAAGGCTCTTCAGCTACTCC 

CAAAAGGGTATGCCCTGGGCATGGG 

Sequence 815 

CCGCGGTGGCGGCCGAGGTACTC I rTTTTTTTTTTTTTTTTI I G AGACAGGGTTTCTGTT 

GCCCAGGCTGGAGTGCAGTGGCACAATCTCAGCCCACTGCAGCCTCCGCCTCCCCAGTTC 

CAATAATTCTCATGCCTCAGCCTCCCAAAGTGGTGGGATTACAGGCGTGAGCCACTGCGC 

CCGGCCACCTTTCTATTTTTCTGGTTMCTTTCTAMTGTTTGAMTGGCTTCCAGTGAA 

TTTCATTTTATTATTGGGGGAMCTTCCATACTTATTTTCTTTCT^ 

AGTATACTCTCCTCCCAAMTTTAGATAGTTGTATTTTTCTGATTATTCCAAATAAGAGT 

GCTGAGAGGCTAATCACAAAGAGCAACAGCCAGAGATTTACAAAGTGGTTCTCTTACTAT 

TGAACATTTTCACTTAT 

Sequence 816 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCCCAGGTAAGATCAC 

TCGTGGGTMGAACATGAGGTTCTCACCCGTMGGCAGGATTTTTATAGAAGGAAGGTAG 

GTCTTTCAACCTATGTCCTCCTTCTGTTCCACAAAGTGGAAAGCCACAAGCCCTACAAAA 

GCCTTGCAAGTCCCAGAGGCTGCAGCCGTATTTATTCTTCAGGCCAAGACTCTCAGGACA 

GAGAGCACCCATGCACCCCGCAGGCTGCAGGCCATCTCCCTGCATTTGGGACTGTCCTGA 
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GGATGGCGGCTTCATTTTTGTCCCTCCTACCTCTGA 
Sequence 817 

GMCCTAGGGCGATTTGGAGCTACCCNCGGTGGCGGCCGAGGTACATTTTGGCAAACCGT 

GMGGGCTTTCNTTTTNGCAGGTTGGACTTCCCCCCCCTAGTNGGCAGG AI mTTTT AG 

GGGACCACCTGAGAAAGGTCTGTTACCGTGCATMACCTCCTTTMCACCTTTTAAAAAC 

TCTTCTGGGGGCCGGACTCAGTGGCTCATGCCTGTAATCCCACCACTTTGGGAGGCTGAG 

GCAGATGGATCACCTGAAGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCC 

GTCTCTACTAAAAATAGAAAAATTAGCCAGGAGTGGTGGCAGGTCCCTGTAATCCCAACT 

ACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGC 

Sequence 818 

GCCAGGAAACCCGTAAAAAGGGCCCNGTTTGTTGGCGG I I I I I I I CCATAAGGGTTTCCG 
CCCCCCTTGACCGAGGCANTTAACAAAAAATNGACNGCTTCAANGTCAGAAGGTGGGC 
Sequence 81 9 

CCCCCCGGTGGCGGCCCGCCGGGCAGGTACTGGGAAATGAGGCAAAAGTNTNTCTCTTCA 
CTGCTAGCTCCTTGGGGACCAGCAAGCGGCTCTCAAGTTGCGTGGTGGCCCAACTGGAAA 
AMGGCAGTTCGGTGCATCCTGGGAATATCCAGGTGAAAGTGTGAGATTTACCTAGAATA 
GCTTCTGGGCCTCTGGGGTTTTCACGCTGTCTCTGGTGAAGGTGTCCATTTTAGAACTGA 
AGCAAAAAGGTTTCMTCCGTTCCGTTTTCTTTGTTTTAGCACTTACCCCAGNNNCCTCC 
ATAACAAAGGTGGNGCCCCTTCAGGGAAATTAA A I M M I I I I CNTAAAGGCCTTGGCAT 
TAANCCN I VTTm I GNGGGNNGGGNAAAN I mTTTTTTI 
Sequence 820 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACAT 

CCAGGNCAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACGCATTCCGCTTCTG 

TCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTC 

CAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATT 

CCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATC 

AGTTTATCAACCAACAAGCAGTTCCAGCACCCAGCACTTTTTACCTGAATTTtAC 

Sequence 821 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCATGCAAAACTCCAGATTCCTATCTTC 

TTTGGGGGAAAAGCAAATTGGAAGCTCTGACAATGCTGGGCTTTACTTTCCCACATAGCA 

ACCATCAGTTGGAGCTGAGACACCTCTGCTCTCTTTAGAAAGAATTATTAATGCTTCAGT 

CTCCATTATTGCTTCCCTMCAGTGAGGATMGTTATTGGCATCAANCCTGGCCGGTTTA 

NCTTGGGGGTTTATTTTNTNNNTTTGGGGCCTNAAAACCCCGGGGGGNNCC I I 1 1 I GGCN 

CNGNGGGGGGGGGGAANTNTNNNANNANGGNNGGGGGGGGGTTTNTCTCNNCCCCCCCCA 

Sequence 822 

CCGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGTGTTTATGGGA 

AGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCG 

GGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCA 

TGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCGC l I 1 1 I GGCC 

TTGATGATTTGAAAATAAGTCCTGTTGCACCANATGCAGATGCTGNTGCTGCACAGANCC 

TGTCACTGCTGCCATTGAAGTTTTTTTCCMTCATCGTCATTGGGATCATTGCATTGATA 

TTAGCACTGGCCATTGGTCTGGGCATNCACTTTCGACTGCTCAGGGAAGTACCTCGGCCG 

CT 

Sequence 823 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGTT 

AGCTATTATCATCACCCTCCTTGCTAGGCAGAGCAGGACAGTGGGGAATTGATGTTTCCT 

CCCCTCCATCTCACAGGTGGGGCAGGGGTGTGCTGAGAAGAGAACTTGGGACTCTTGGCC 

CCTGTTCM1TCTCTGCTTAACCTGCTAGGCAATTTGGGCCTCTGAAAATTCAGTAATCC 

TCATAGCAACTTAGACGTCACCTGGGCCTGTGGTCCCCTTCCTAGCCTAGGAGTCAGAGC 

ATGMGCTCCATCTGTCACATTGGTTTGTTCAGAGAACTACACATGCGTTTTATTTTAGC 
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AGCATACAGGTTCCCACTTAGGCATTGAGAGGACATAGGAAGCTGTTTAACTTCCTA 
Sequence 824 

ATCACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTTATTTTA 

AATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGG 

AGCGGGAAGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGA 

CTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTC 

ATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGT 

TGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAA 

TACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAA 

TCACCCTTGAACCCAGGAGGCAGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAG 

CCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAGAAA 

Sequence 825 

CCGCGGTGGCGGCCGAGGTACAGATGTATGGATCTCATAGCATTGAGGGGTCTTTCAGAT 

TATGTTTTCAAACCCCTCACTTTCTCTTTTCAGATAAGACCACAGCGACCTGGGAAAGTG 

CAACGTCTTAGCCAAAGACACAGAACTATTTAGCGACACTGTCTAGACTCTAGTTTCCAT 

GTCTCCTGACTTCAGTCTAGTGTTCCACCCCTGCCGCCCACCCCTGCCCCATCCTCATTC 

CTCCTGTAGGAGAGGCCAGACCTTTGCCTGCTGCAGCTTGTGGCTCTTCTCCTGCCTTCA 

GTTNTTCCATTGCCTG 

Sequence 826 

GGGNNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCTGTCTGGGCAACACT 

GTCCCGNNGGGGCCCCCATGACCAAACTAACTCTGCTTCTACCCAGAAAGGGTGCAGAGT 

GGCCACTAGACTTTTATGTGGCAAATGGGATGGTTATGCCCAGCCTGAAGCCAAGATGCC 

CTTTCTGGTTGCCTTGATTTGTGTTTAACAGCTCCAAATGCTTAATGAGGCAGTAAGAGA 

CGTCTCTCTTGGGCAGTACCT 

Sequence 827 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGGTGTGGCTACTACCGTTACA 

ACTGCCTGTGCTTGGACATGGACCCTCTGCAATATGCGGCAGTTTCATTCATTGCCCCCT 

ACATTCTACACCAAGTAGAAATGGAAGGCAATTGGATACTTCACAGACAAGATCTAAGTG 

GAGAAGGAATGCGTCCTGTGGCTGCAGAGATCCTTGGAGCTTGGAGGGGAGAGCTTGAGC 

CCCACTGATGATGACCTCCCACAGCTCGCCAACTCAGCCCTCCCTAAGTCCCCATCGGGG 

GCCAATTCTCACTCTGGGGTTGGGGGGACTCCACCATAGCTCATCCATCATAGGGGATGT 

TGGTATCTACTGTGGGTTTGGGTAGGGCCCGATGTGCTGAGGATGGCTCCCCCACAAGCA 

AGAGATGTGGGATTTGG 

Sequence 828 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCANGTCTTTCNAGATANGACCCTGAATGCCTNAT 

TCCATTGGGCTGGGGCTTCCACCTACCAAGTTGGGTGGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCMCCMCAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTTA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCCGGCCCCCCCC 

Sequence 829 

CGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCTGATCTACTCCTCTCTACAACAAC 

CTTGTGGGTGACGTTATTATCTCCATTTCACAAATGAGGCCACAGAGGTTCTAAAGGGTA 

AATGACGATGATGATGAGAGGTAAGTGATAAAACAATGTCTCCTGACCACAAATCCTGGA 

ATTTAAACATAAGNGTAGTAAACATGAACTCTAG GAAG CCTCCTGGGGCTTCTNCCTGTG 

TCTGGAGCCCCTGCACATGCCCAAAGGAAGTCCTTTTGGTTCTNCGNTCAGNAGAGAAAG 

GGNGCATTTCACTAAAAGGGAGGTGGGGAAACAAGACTGGTGGTAGGG 

Sequence 830 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 
GAGCAGTTATCCAACAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 
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, TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 
ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 
TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 
AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 
GAGAGTAATGACATCGACTCCATTCACGTCTGAGATGGAAAAGGCTCTCAGCTACTCCCA 
AAAGGTATGCCCTGGGCATGG 
Sequence 831 

CCGCGGTGGCGGCCGAGGTACGCGGGTAACAGGAGTCTTTGCTGAGTGATCATCTGTTTA 

TTCTTTTACTCCACAAATATCGAATGTTTACAGCGTGCCTGGCACTGAGCAGGGCTGGGG 

TTTCCTGACCATATGGACCTTCCTGGGTATATCTGTGGGGCTTGAATGGTGTGTGACCTT 

GTGTCCTGCCCG 

Sequence 832 

CGGGCAGGTNCGCGGGGGTGTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTC 

CGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCC 

GGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGC 

CCCCTTCTCATTCCGATCGCT1TTTGGCCTTGATGATTTGAAAATMGTCCTGTTGCACX) 

AGATGCAGATGCTGTTGCTTGCACAGATCCTGTCACTGCTGCCATTGAMGTTTTTTNCA 

ATCATCGNCATTGGGATCATTGCATTGGATATTAACCCCTGGNCAATNGGCTTGGGCATT 

CAATTTGACTTGNTAAGGGAAGTNCCTCGGCCGNTNTANAACTAGNGGGATCCCCCGGCT 

GGANGMTTTCAATTTNAACTTATTGATACCGTCCANCCTTGNGGGGGG 

Sequence 833 

ACCGCNGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACTTTTNNGGACAAG 

GTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACC 

AACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGAC 

CCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTG 

GGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTTATCA 

AC 

Sequence 834 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACCTTACCACCC 

CATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGCTCAATGTAAACGTGTG 

CACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTTCTCTGCACCCCTCCCT 

CCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGCTGGGGCGCTGGTCTGA 

TTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCCAAAAGGTTGGGACAAA 

CACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCATTTGTGGGGATAAAGGAGTGA 

ATG 

Sequence 835 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATAACTTATCG 

GCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGGAGCGGGAAGTGCAATCCTACC 

TTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGACTACCACACTTCGCTAAGGTC 

ACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTCATGTCTATAATCCCAGCACTT 

TGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGTTGGAGATCAGCCCGGACAACA 

TAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAATACGAGGCATGGTGGTGCATG 

CCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAATCACCCTTGAACCCAGGAGGC 

AGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAGCCTGGGGTGACAGAGCGAGAC 

CTTTGTTTCAAAAAAAAAAGAAG 

Sequence 836 

GNGGNGGCGGCCGAGGTACTTTAACANGCCATACTCCAGTCCCAACAATGTTAAATGCCA 

AAGCAGTGTTGGTAAAAGCCTCAAATGGTGAAAAGGACAGAAACTCAAACCCGCCCTTGT 

GCCAGTAAGTAACTGTTACTTATCTCACAAAGCGCTTGGCTCTGGAAACAATCTAACTCT 

GAGCTGCACGTGGAGTCTACATGGGAATGTGCAAAGCATGTATTTCCTTTTAGGTGCAGC 

AGAGGTAACCGAAATTCAGATAAGAGAAAAAAAATCCAGATTTCAATGCAAGAGGTGGAA 
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, GATCCACGMGATACTCGTTACTATTTGGTTTCTAGGAGCAGGATTGCCACTAGATATGA 
TGGAGAACAAAAATGAAGAGGTGTTGTGTAACAAAACAAAACAAAACAAAACAAAAAAGT 
AGAMGAAAGAGCAACAGGCCGGGCCGCAGTANCTTCATGCCTGTMTCCCAGCACTTTT 
GGGGAGGCCCAG 
Sequence 837 

NTTGCGTTGCGCTNACTGNCCCGCTTTCCAGTNCGGNGTAAACCTGTCGTGCCCAGCCTG 
CATTAATGAAATCGGCNCAACGCGCGGGTGAGAGGCCGGTTTGCGTATTTGGGCCGCTCT 
TCCCGCTTTCTTCGCTCACCTGACTCCGCTGCGCCTCGGGTCGT 
Sequence 838 

CGCGGTGGCGGCCGAGGTACTAGGCACTGAAGATACAATAAGCAATCCAGNAATNCCTCT 

TTGMG MTTTATTCTG ATGAAATAGAGACAAGCCTATTAAAGTATTCAAGGCAACATTA 

CTTAAATATTTTATTTTATTTTATTTATTTTTA^ 

CTGAAGTG 

Sequence 839 

AGGTACCTGTAGTCCCANTTACTTTGGAGGCTAAGGTGGGAGGATGGCTTGTGCCCAGGA 

GGCGGAGGTCACAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGCAATAGAGCCA 

GGCCTTGTCTCATAAATGAAATATAAAATATAAAATAAAATAAAATAAAACGTTGTTGGC 

AAAGATGT ATTCAA AATAGTATGTAGMCTTACCTGGAAGGTMTTTTGCMTATTTACC 

AAMTATACTTTTTCATCTAGTAAGGTCTCCTTCTCAGAATCAGTCTTAAAGCCATTCAA 

ATACAGATTTATGATTAMGATTTAAGTCCAAAMTGCTCATTATAGCAATATTTATAAT 

AGGAAAAATTGGGGAAAAACAATTATACATCCAACAGTAGTAAGAGTGTGACTACATTAT 

AGTATTAGTATGTMTGGGATTGTCACAGAGCMCACACATGTTTTTGAAGAATATTTAA 

GGGCGTGATAAATATTAMTGTAAATGTAMTTGAAAAAATGATATCTGTAGATTTTCAT 

TATGCATTTCTTTATGAAATTTTTNGATATACACAAMNAAMTAGTCATGCATTTGCTT 

CATGACGGGGACATATTTCTGAGAAATGTGCTGTTAGTCGGTTTC 

Sequence 840 

GGCGGCCGCCGGGCAGGNACANCTTCCGTGGGGGGGCNNAAAACCCCCCACCNAAANCAA 

AANAGCANCAAAGGAAGAATTNNTTAAGGGCAGGGGGGGGAAGCCCCNAAAAACCNGANG 

GCAAA NNCC AGAGCNGNGGNNNAGCNCNAGNNNCNGGGAAGAGCAANCANNACGCTTTTG 

TGAGGTTTTNGGNGNNGGAGGGGGGGAACCCCCCCGANAAAAAACCCAGNAGCCCCCCCA 

CNNAACAGGGGANGAANAAGAGGAAGGNAAAGGANNNAANNAAGAGCCACAGNCCNGGAA 

ACANGAAANCANNANNNCAGACANGCCNNNGAANANGNNGCCNACNNNAAAAGAAGGNNN 

GGNGCGCCANNGGGGAGNAAANNGGAANNANNAAAAAAAAAAAGGAAAANGNGAANGAAN 

GAGGANAANANNNGGGGGAGGGNAAANGGGGCGGGGGNNNAANGCCNAANAANANNNAGN 

NNNNGGGGGAGGGCCNG 

Sequence 841 

AGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGC 

TCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTT 

CTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGC 

TGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCC 

AAMGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGMTTTCAGTGATTCATTTGT 

GGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCG 

Sequence 842 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTACTCCCAAAAGGCTATGAAATN 

GGGGAAAACCCCAGGTGATTCATGCCTGCTTAGCTGCAGNATNTCAGTNGCANTAGGTGG 

AACCCCAAACCCAGNGCANAGTGCCAGNGTCTGCTTNGGTGAGATATGAGTGTCAAGTCT 

CGAACCAAGCAACCTATCNAAAGCCTGNGACACTCCTGGCCACAGGCGGNTGGTANAGGC 

ATAGNANACTATTGCCCAGGTGACGTGACTTCACAGATGCTGGGAAGCCTGCTGCCCCAT 

CCMTACMTACTGCCCACTGTGCATAGAAACCAGATTCCAMGTTAGAGCTTCGTTTTG 

GCCATGAGTGCMTTTCACTGCAATGTTTTATCTTACTCAACTGCCAGGGTCAATTTAGG 

TGGTAGGGCTAAATCTCCTTCTTTATATTGGTCCMATGATTTTTCCTGATGCTGCATTC 
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, CCGGA 
Sequence 843 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCTTAGACCAGGAACACAGGGAGGTAGAG 

GGCAGCAGAGCAGGGACTGGCTTCAGAGCCAGACAGGTGGCTATGTGACTTAATGTGTCT 

GAACCCTGGTATCCTAGTCTATTAAATGGTATAACAGCAGCTTCTAGTATGTAAGTTCCT 

TGTCGGGAGMMACTGTTTTGCTCATGGCTGGAGCCTTAGCATGTTGCATCATATTGAA 

CATGTMTAGATGCTCAATAAATATATTTTTAAGAATAAATAAATGTAAATGAAAATTAC 

TTCACAGTGTTTCTGTAGAGATTTTATAAGATATGGTATACACAATGCATAACATAGGAA 

CTGACGCTCAAAMTGCCAGTTACTTCCATCATTGNGTCATAGGCTTTTATGTTCATTAT 

CCTGCTGCATCATCCCAAAGAA 

Sequence 844 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGT 

GTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGNGGTGCCATCTACATTT 

TTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAA 

N TAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCG 

CTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCANATGCTGTTGCT 

GCACAGATCCTGTCACTGCTGCCATTGAA 

Sequence 845 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTMCCCTMGGGATTCCTACAGCTTTTCT 

GCATGTTAAATAGTCTGTTTTAGCTTATTCTCTTATTACTTGTCTTGG I I I 1 1 ACTTTGA 

MGTTTGCTTMTMTCATGGGAATATTTTAGATTTTAAAATACAAAATATACAAGCTAA 

ACTTGAGAGCAGTTTTTAGTTGTAGAAACTGTTTCTTGMGTAATTGACTTAGCGTTTGC 

TCTGCCTCTTTCTTTTCTTACCTAGGTAGGTAGTGGGGACTCCTTCAATTATCTGAGCAA 

TTCAAATCTCAGAATGTAGTGTTGGGTAAATTGAGGGTT 

Sequence 846 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGNGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATbAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCA 

Sequence 847 

TGGAGCTCNCCGCGGTGGCGGCCGNGGTACTCCAAGCAGTCCCAAAGTGGGAGTNCTTAA 

AACACCATGGGCAGGTGAATGGCTGACCAGGTGGAGGTGCACAGTGCACCATGACAAGAG 

CAGTGGAAAATGGGTGAATCTGAGATGCCTGGAGGCGAGGGGGAAAGAGCACATCACAGA 

GGACAACGTCCANNGGGACACCCTTTTATA 

Sequence 848 

CCGCGGTGGCGGCTGTGGACTGAAGGGTGACTGGTTCCACTGTGGTCTCCATGGGAACAA 

GTTGTTTCTGGAGTCTTCCAAGGAGAATTTCTCACAGTGGACCTGATCTCTGGGCTGATG 

CTGGGTTCCTTGGAGCTCATGATTTTGAGAGTGGTAGACATTTCTGGGCTTCCTGGGGAT 

GTGCCTGCTGGACTGCTCCCCGTCTCCTCTGCTGGGGCAGGCCACGTGGAATTTCCTTGT 

GCTGCCTGGCTTGACATCTTTA 

Sequence 849 

CCGCGGTGGCGGCCGAGGTACCTGAAGAATCTCTCTTCAGCTCTCTTCTCCTGGAAACTT 

GAGTGGGGCAGGAGGAAAAGCGGAGCTAGGTGTCATTTTAATGAGGAACATACTTGTCTC 

CTCCATTTATCTGGCCCTCCCTGATGGCACTCCAGAATTCCAATCCCACACGATTAACAA 

CATAGTTTCCCTTTTCTGCTTGAAGGTCCATTCTCCTCTCMTTTCAAATCACCTGAGAT 

ACAAAGCTGCATTTCCCCACAAGAACCAGTTCCCTCTCCTTTCCTTCAGTGCTACTGTCC 

TTCTCTCAGACCACCAAGCTTAAAAACTCCAGAGGCTCAAACAGCAAAGATGGCAGCCCG 

CTCCTCCCTCTGGGGAGTTCTGGCCCAGGGAGTTTTCAAATTTCTGTAGGCGGAAGAATA 

CTAGCGGGGAGTGGCTGGAGACCCCAGTTGGTAGGGNTCCACATTTGGGGGAAGTGAGCC 
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, CAAGCI I II IN 
Sequence 850 

CCGGGCAGGTACATGAAAGTAAGATCACAACCACAGGAACCACACAAAATTCAAGGCACC 

AGAGGAGCCCAGACTTGGCTGGCMTGCCTGTTTTGGAGCTATTCCACATTTCTGGAAGT 

CAATGGGAATACCGGAATATGAAAACACTATGAGGCCGGGCACAGTGGCTCACGCCTGTA 

ATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGTTCGAGACTAGC 

CTGGCCAACATAGTGAAACCCCATCTTTAATAAAAATACAAAAAATTAGCCGGGCGTGGT 

GGGGGGTGCCTGTAATCCCAGCTACTCCGGCGGCTGAGGCAGGAGAATTGCTTGTACCTC 

GGC 

Sequence 851 

CCGGGCAGGTACTTTTTTCTTTTTTC I I 1 1 I I I I I I I I GAGTGGGGGCGGGGTTTCGCCA 

TGTTGGCCAGGCTGGTCTTGAATCTCGGGTGATCTGCCCGCCTCGGCCTCCCAGGGTGCT 

GGGATTGCAGGCGTGAGCCACCACGCCCGGCCTCGATATATTCTTACAGTGGAATACTGC 

TCAGAAATACTGATGAATCTTAAAAAACATGATGTTTAGCAAAAGAACCTTGGTATAAGG 

TTCTTGGTATAAGGGATACATAGTCTATGATTCCATTATATGAAATTCTAGAACAGGAAA 

AACTATAGTGAAAAACAATCAGATTAGTGGTATCTGGGGTAGAAAGTAGGAGGAGATTGA 

TT 

Sequence 852 

CNANAGGGCTTTTTGGGGGCAAAACCGCGGNGGCGGCCGCNCNAGAACNAGNGGANCCCT 

NTTGGGGGGGGAAAAAAACCCCAAGCCCACCGANACCGNCGACCNCGAGGGGGGTNCCGG 

NACCCAGNGNNNGNCCCCTAAANAGAGGGNNAANNGCGCGCNNGGCGNAANCANGGNCAN 

AGCNNGNNNCCNGNGNGAAANNGNANCCGCNCACAATTTNTCTTTtNTAGNCGAGCCGGG 

AGCAGAAAGCCCNAAGAAAAAAGN 

Sequence 853 

AGGTACCCACAGCCCTTTCTTTTGGAATTCCCTAGAAGGGGTCTGTGCCACATACAGGAA 

GTAGGGAGGGTGTCTTTGCAGCATATTTCTTCTCTTGGAGTTAACTGCGAACGTTGCACG 

GCGACCTCTTGATCCATTCTGTGAAAGCCCCAAGCCTGTCATGCAATAAAGACGTCCAGT 

TTCACCGCAGCAGGGAGGCCGCATGAMTATTCCACTTGAACAAAACCACTTAGCAGTTT 

ACATCAATGCTTACCCTGTCGCA7TGAAAGTGATGTGAACCCACACCCAAGAGCCCCCAA 

ACCAGCACGTTGATACCMGTTTCCCCAGCTGCATCCAAATCAATTCCTTCTT 

Sequence 854 

CNTTTGGGGGCTMNCCGCGGGGGCAGCANCNNCACCGGCAGCGCGCNATTCTTTGGGGG 

GGTGNAAAACCCCCAGACCCCCGAGCNCNNCCACTCGNCNCACCANTNTAGAGACTGGGG 

CCGNTCCNCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnn i linn iGGGGTTTTNAGNGAGGGGAANCG 

CNCGCAAAAAAANAATCNCGGCCANAGGGGGGCCNGGGNGAAANTGTTANCCGNGCACAA 

NNNCACACAANANACGAGCCGGGAGCAAAAAGAGGAAAAGCCNGG 

Sequence 855 

CCGGGCAGGTACGCGGGCTACACACACAGTTCGGATGCCAAGGGTGACACCCCATTCCCT 

TCACAAGAGGCGGTTCTGTCACAAATCAGCACTCCACCCCCACCACACCTCTCAGTGAAT 

GAAGTGCTGGTGGTCTCACTCCCCTGGTGACCCTTAGCCGTGGGATGGGGTGGTTACACT 

AAGGCTTCAAGCTGAGAATGGCCATCATGGCGGGAGGCTGTTTGCAAAGGCACCTTCTGT 

CATCCTGGGGTTGGCTAAGTCAACTCCACCCCTTCCCAAAAAAAAAAAAAAAAAGTACCT 

Sequence 856 

GCCAGGATTCAAACCAGGGANTTTGCTCCAGCACTCCGGCTCTTAACCTCAACCGTCTGC 

CTCTCCACAAACACCAGGATCAACCACCAAGACCAAAAAAACAGTCTCACAAACCATCAA 

ACATTGCACTTGGTGGCTCAGGACCTTAGCTTCGTCTTAMGGTCCCTGTTATGCTTTTT 

CTTTTTGCCCCAGTGTGGAGTGGTCTTCGTGTTTGTGAGTGCAGGGGTCAGGGGTTGTGT 

CTTTTCTTCTTGTNCCCTTCCAAGAGGTGACATGTATCCTTGATACTGGAAGGGCCCTT 

Sequence 857 
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AGGTACGCGGGCACTCCAGCCTAGGCAACAGAGCCAGATTC I I 11 I I I 1 I I 1 II 1 I A AAA 

AGTCTTTTAAMAATTCTTTTATTGTGCTGATTTTATTGTGTCATGAAGTGTAATATCGC 

ATGTAGGTATGTGTCAAGTATACAGAGTGTCAGGCATACGGTGTTCAGTCATAAGCAGTT 

CTGGCCTTTGGCCCTGCACTG7TTGTTGGCTTTTTAGGTAGGMCCTTCTTAGAGTAAGA 

CTGTCATGCTAAAATTGTAGCAATCAAATGTGCCCCCCATACAACTCATTTGAGGTTGAG 

ATTATGTTGCTAGAGTGGAGGAGATTGGAGTGTTCTAAATGCTMCAGTTTGTCTTGCCT 

Sequence 858 

GGCGGCCGCCCGAGAACGAGGGNAGCCTTTGGGNGGGGGGAAAAAACCCAGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGNACCCANCGCAGAGNCCCCAAAAGCGAGG 

GGCAAGNGCGCNCAGGGCGGAAANAGGGCNACAGCCGGGGCCGGAGCNAAACAGGCACCC 

GCCCACANTTNNTTACAACANACNANCCGGGAGCATAAAGCCGCAAANCCAGNGGAGCCA 

AAGGAGGGAGCGAACCCACAG 

Sequence 859 

GGCAGGTACAACCTCCATCCCCTGGGCTCAAGCGATCCTCCCATTTCAGCTCCCCCTGTA 

GCT GGGA CTACAGACACACACCACCGTGGCTGGCTMTTTTTGTATTTTTTGTAGAGGCA 

GGGTTTTGCCATGTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCCTGCC 

TCGGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACCATGCCTGGCTGGGACATTAT 

TCAMTTGAAGTGAGGACATGATGTTAAAMGTTCTGGGCAAGTATTTTACAAGTTAAAA 

TACAGATGTAAGACTTGACTTGATCAAATGCCCAGCTCTGTAATTTCACCTAAATTG 

Sequence 860 

AGGTACMCATTGTGAAMTTTTCCTATCATTTCCTCTAAAACTCTGCAGGCATGGAGGC 

TGTCTGCTGAGATATAGCAGGGAACAATTTAACCCTTTGTTTTGCACCCACAAGATGAGC 

ATTACCAAATTCCCAACAGAGACATGCTGGGGCTGTGCTGTGCCTCCACCCTGCCCCCTC 

CACAGCCAGCTCCGCGTGTCTCAGTCTGTAACTGATAGCATTCCACTTCCTGCCTGTACC 

TCCCCGGC 

Sequence 861 

CNGCGGCGGCCGAGGNACACNNANAANAACCTTTTAAGGGGNGGAAAAACCCAANCCCCC 

CGNCCNACCNCAGNGACNNGANGNATNTACNGGAAACAGGGCGCAGCCCNAGGAAGGACA 

GNNGAAGNCCNNACNGNGCAAGNCNAAANNNNAAGGAAAANNAGNCCCGCGANGAGNNNC 

CNCANGCTTTNNNTNTGCGGGGACCAGNCAGGGGNCGNCCCCGACANAANCAGGGCNCGC 

CGNCGGNGGNAGAGNCACNNNGCAGGGGNNGGNGAAGCNGCNCCCANCCANGGACCNNGG 

CCGCNCNAGAACNAGNGGANCCCCCGGGCNGCAGGAANNCGANANCAAGCNGANCGANAC 

CGNCGACCNNGAGGGGGGGC 

Sequence 862 

AGGTACI I I I I I I I I I I I I 1 1 I I n I I I I GGNAGAGATGGGGCTTTACCATGTTGGCCAG 

GCTGATCTCCAGCTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAATTGCTGGG 

ATTACAGGCGTGAGCCACCGCACCCGGCCCAGTCTTCATATTTATAAATAAAGTCTTGTG 

GGMCACAGTCACATTTATTCATTTACATTTTGTAAATAGCTGCCTTCAAGAGGTAGAGT 

TGAMGTCTTTAAGACAMGATCAGGTGGCTTGCMAACCTAAAACGTCTGGTCTTTTAC 

CTAAMAGTTTGCCMCCCCAAAACMGMTATATTCGATTGTAATCTTTACAAAGGTGT 

TGGCTATATTTCAGTGTGT 

Sequence 863 

NCTATAGGGCNAATTGNAGCTCCCCGCGGTGGCGGCCGAGGACAAATTCAGTCCCAATAC 
TCAATACGTATTATAGATGACTATGAGTGCAAACCTTAGGATGNGATT^f^CTGAATAATN 
GNTCTTTGTAGGATTTGGTTACATTATTTAMATGAAAAAGATCTAGTTTTAGTGTGAGC 
TCAGTAATG NTAATNG GTTMGTTCATTGCGMTCTTG AGTTTTAG ATAAGTAGTTATTT 
TTTTCMTATCACTTCTGTTTTTAGTGATATTATATCAAGAAACAACGTATTCAAGAACC 
ATGGCTGACAGTGCCAGATATACTTAGGGATAAACATCAAAATGCAATTATAGTTGCTAT 
MCGTTAGATACTCGGAATCAAAATTTATTTGCANGCTGACTTGATAAACTAAATGAA 
Sequence 864 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTCAGCAG 

AMTGTGTGCTTTAAGCAGAGTCACAGGGGCCTGGGGCTGAACTGAGTCATTTCTCAAAG 

ATATCCCTGCCTGGGATGATGATGGCTCTAATTGAAGCTCTGGCATCATCTGGGGCTTTA 

TGAGCCAAGGGAGATAAGAAGAGCCACAGCAAAACCCTTGGGTCTACAGTGCAGGCTGCA 

ACCMGGCAGCATTTGCTAGAATATTTGTGATTATGTGTTCAACCTACAACCT 

Sequence 865 

CTA CTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTAT 

ACC I I I M ATTGTTGTTATMTTATTATGGGGTATTTCTAATTAATATGATGTTGAAACC 

TGTTTGGCACCTTCTGGAAGCTACCAAAAAAATGACACTCCATTGAAGTGCTTAAAAGCT 

GTTCTCATAAGAATTCTACTGGCCTATTGTAAAAAAAGAAAAAAAAAAAAGAAAAAGAAG 

MAGACACAAAGAAAATMTCTAMCACCAAAAACTAMCACAATTCCMTCCTTmCT 

GTACCT 

Sequence 866 

ACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGAAATCTGGACAGTG 

CTGCACAGATTGATACATTAGCCTTTGCTTTTTCTCTTTCCGGATAACCTTGTAACATAT 

TGAMCCTTTTAAGGATGCCAAGAATGCATTATTCCACAAAAAAACAGCAGACCAACATA 

TAGAGTGTTTAMATAGCATTTCTGGGCAAATTCAAACTCTTGTGGTTCTAGGACTCACA 

TCTGTTTCAGTTTTTCCTCAGTTGTATATTGACCAGTGTTCTTTATTGCAAAAACATATA 

CCCGATTTAGCAGTGTCAGCGTATTTTTTCTTCTCATCCTGGAGCGTATTCAAGATCTTC 

CCMTACAAGAAMTTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCA 

AMTAAGTCATTATGGGCTCAMTAGAAAGMGACTTTTAAGTT 

Sequence 867 

CCGCGGTGGCGGCCGAGGTACATAACATGATATCAAGGAAATGCTTGAAACAAACTTTCA 

CAATAAAGTCAGAAAAAAACTGTAAAAATTGTCTGCAATCCAAGAAAAAGCACGTGCCCT 

GTGTGTAGGGGGAAAGAGGGAMGCACTTGCAGTGTGACTTTATGTGGTCTTTCCCCAAG 

TATTGCTACGTTTTGACCTTTGGCCCMCTGMCAGGTGAMTGCCCTTCACATMGTTT 

CAATCCCCAAGAAACTAGCTGGMTGCAGGGGACTGTAGACACACTCCTGGACCAAATGG 

CATCGACTCTCAGAATCCAAAATGGGCCCTGCCCTCATTCTGAGCTTACGGCCCCAAGCA 

TATTCTAAACAAAGC I I II I IAA 

Sequence 868 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGATATCAC 

CTGMTTATTAATGMTGCCCAGGAAGTMTTTTCTTCTCATTCTTCTAAAACTACTGCC 

TTTCAAAGNGCACACACACCGCGTNCACATACACTGCATTCGTTGCTCCAGTATAAATTA 

CATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATGAAGACACC 

ANAGTGTATGCATACAAATCTCACTGTATTAAAGATGCAGGTTTTCTAATTGTACCT 

Sequence 869 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCGACGCGCGGGACAAAG 

GGMGCGAAGCCGGAGCTGCGGGCGCTTTTTCTGCCCGCGGTGTCTCAGATTCATTCTTA 

AGGAACTGAGAACTTAATCTTCCAAAATGTCAAAAAGACCATCTTATGCCCCACCTNCCA 

CCCCAGCTCCTGCANCACAAATGCCCAGCACACCANGGTTTGTGGGATACAATCCATACA 

GTCATNTNGCCTACAACAACTACAGGCTGGGAGGGAACCCGGGCACCAACAGCCGGGTCA 

CGGCATCCTCTGGTATCACGATTCCAAAACCCCCAAAGCCACCAGATAAGCCGCTGATGC 

CCTACATGAGGTACCT 

Sequence 870 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGG 
ACCCGGAAGTTTAAATTCCGCTCCCTCCACGAAAGAGTTGTAGTGAGTGAAAATAAATAT 
TAAAACACACGGAAATGTATTTTCCTGGCTGCAGCACCNGCCATCTTGCCTCGGNAGGAC 
TCATTTTNAAAAACAGCAGCTTCTTGAAGCCCCANAACGCATTCCTGTGCTACGG 
Sequence 871 

CCGGGCACGGTACAGAGCCCAAGACAAAAGATAGGCCTGTGAGGATAACATCTGGTATAT 
CTGACCCTTCCCAGCATGGCCAGGAGGCACAGCCAGGCCAGGGAGGGCATACTGGGTTTG 
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GCTTTGCCCTGCAGCTGTTGGCCTAGGTGCTGCGGTCATACATATGCCCTNAGGCCTTTC 
CATGGCTACCTACCTAGAACCCAGATTC I I 1 1 1 II I I I I I I GAGACGGAGTCTCGCTCTG 

TCCCCCANGCTGGAGTGCAGNGGCGCCATNTNAGCTNACTTGCAAGCTNCGNCTTCCGGG 
T 

Sequence 872 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTCCGGGTCATCCGCA 

GAAATTCCTCATAGATGGTAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACA 

GGGAGTCCACCCCGGTGTGGTGCCTGCTTGNGGACAGACCTGAATGTTGAAACTTGACAG 

TCA GAAAM TAACTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATA 

TTCCTTTTGTTCCTCTGGNAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGT 

AGGTTGGTGATGGTAGAAATTCAGGTACGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTT 

GATGAACTGATGACTCC 

Sequence 873 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCGCCCGGGCAGGTACAATGCTCACT 

GGGAACCAAAGTCAGGCATGGGGCTGGGCTTTAAGGAGCACAAACAAAAAGGAGGGACTA 

GAAAACTTCAGAAAGGTATTGGTGGGGGATGTTGCGGGGGGACAGGGGACAGCGAGGATG 

TGGGATCCCGAGATCGTCCAAATCCCTATGTGTAGACATATGTGTATAAAGGCCTTTAAG 

AGACTCAGGCTGATGGGGTATCTGTAATAAATCAAACATAATATAACAGCACGTCAAGTG 

ATAAGGGGACTCTGGAAAAACAAGCAGCAAAAGGAGCAGTATCAAACTCCACAGAAATTC 

ACAAACATCAAGACACCAAGAAAGCTGCATTNATTTAAATCAAGGTGACAGGCTGGGCTC 

TGTAGCTCCAGCCTGTAATCCTAGCACTTTGGGAGGC 

Sequence 874 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI mTTTT 

I I II I I I I I I I I I I I I I I IGGGATGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAATG 

GCACAATCTTGGCTCACTGTAACCTACACCTCCCGGGTTAAAGAGATTCTTCTGCCTCAN 

CCTNCT GAGAAGCTGGGACTACCAGGGGATCCCGCCCCACCCCGGGTAGGTTTTTTGTAT 

TTTTTTAGNNAGAAGACAGGGTTTTCCNCCCATATTGGGCCCAGGGCTTAGGTCTCGGAA 

CCTCCTGGACCCTTGNGGATCCTGCCCACCCTTGGGCCTNCCAAAAATGCTGGGGATTAT 

MGGNGGGGAGCCACTGTGCCCGGGCCMCAATAMTTTTTTTAAGGGTTAGTCAAACCT 

AACAACAAAANTTTAAAAGGTCAATCAGTAGTTCTAAC I I 1 1 I I I I I I I 

Sequence 875 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGACTTTGTAA 

ATGTGATTCAGGGCCCCCAGCACCCCTGTGTCTGCAGAGTGCCTTCAAAACTCAGCTGTT 

CCGGCCGGTGCCAACCTGTGAACTTCCCACCATATCCCAGAATCTGCTATTCCCCAAACC 

ACTTCCCAGTTTCCTTTCAGTAATCTTTCTGAAGGAGCCAGGACAATAGGGCCTGTTGTT 

TAGTGMTTTCTTTATTATTTTCAGCCTTTMAATGTAATTTCCATCTCTTGCAATG 

TTGTT TCCC I MUM I GCTTCATTTTGTTTAMTTTTCAGGTATTTAGCTCCCCTTTCA 

TATTATTTTTAMTTTTTTAATTACCTGTTGTAGGGGTG^ 

AAATTTTACTGTTGTGATGTACCT 

Sequence 876 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATTGA 

TCAAAAGCTTTGTAACCACAGGAAAAAATAAACTCTTCCATCCCTTAAAGAATAGAATAG 

TTTGTCCCTCTCATGGGMTTGGGCTGTATGTATATTGTTCTTCCTCCTTAGAATTTAGA 

GATACAAGAGTTCTACTTAGAACTTTTCATGGACACMTTTCCACMCCTTTCAGATGCT 

GATGTAGAGCTATTGGGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCT 

AGAGATAACTCGGGA 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCTTA 
AACTGCTCTGCTCTTTAAACCAAATACATACACACATACACAGATATAGTTAGATACAGA 
TGTGTGTGCATATAAAAATATGACACTCCTTAGTAAAATATTCCTCTAGACCTGGGGTTC 
ACACATCCCTCCTCCTGATCCGTGCTGGTGCCTACTCAGGCACTACTTTGCAGATTTCTC 
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TMTGMCCAMCCTGTTTTTCCATGGMCCAATMTMTTAATCTAGAGTGAGCCATTT 

GGCCTCCAGAAACAAAGAGATTTCCATCACAGAGTGTTGGTGAGGGGTCATGAGTAAGGC 

GGGGGGGCAGTGAGAGCAAGCTGTTTTATTGTGAGAGTAGCAGGCAGGCTGAATGAGAAG 

GGGTAGCTGTT 

Sequence 878 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCATTCC 

GGGTCATCCGCAGAAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCG 

AGAAGTTACACAGGGAGTCCACCCCGGTGTGGGTGCCTGTTGGGGACAGACCTGAATGTT 

GMACTTGACAGTCAGAAAAATMCTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGC 

GCATCCTCMTATTCCTTTTGTTCCTCTGGTMTTGGTGGTGCCTGGCTGGGCTTTGTCC 

TGGGAATATGGTAGGTTGGTGATGGTGAAATTCAGGTAGGAAGTGCCTGGGTGCCCCGCG 

TACCTCGGCCGCTCTAGAACTAGTG 

Sequence 879 

CTNCT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I I I I I I I II 1 I I I 

TTT7TGGGGAAATMCAGGGGAGAGCAMTTTCTAMAACTTGGGGTTTTATAGTMTTT 

CTGATTTTCATGTTTAGAAAAAGAAATCACATTAAAAATATGC 

ATAGGATACACTATAATATTATTGTAGTCCAGAAAATCTGTATACTATAATTCCTAGGGA 

AAMGAGAAMTTATTAGTGTCAAAATACCTATMTTCCCACAGTTACCATATACATTTT 

TAAAA ATTGT TTAAATACACAAACAATGATGATGCTGTCCTACTAGAAATGACAGGAGCN 

AGAGCTTTTACCTTTCTTTCAAAATGCCTTTMCCCCTTTCATTATTNCCM 

ATTTAAANATTC I II I I II I 

Sequence 880 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGGCGCAATCT 

CAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGATTGCAGGCATGTGCCACCATGCTCGGCTAATTTTTTGTATTTTTAGTAGAA 

ACGGGGTTTCGCCATGTTGGCCAGCTGGTCTCCAACTCCTAACCTCAGGTGATCCACCCG 

CCTCGGCCTCCCMAATGCATCTCTGGTCTTTAAATGCCCTTTGCTGTATATTCTATAAC 

ATCMGTCTCAGATCTGGTTTGACCTCAGTTGGCCTCTTAATAGTTTTCCCCTATGAACA 

TTCTGGTCTCCCAGTAAGCCTGTAAGCAGCTGAGACTGGGAAACCATCTCTTATATCCCA 

CATCGTCCCAtG 

Sequence 881 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCACACTGGTAAAGAGTGGCAAGGTAGC 

CTTTGTMCCAGATATATCTGATCTCAAMTCAATTTTCTTAATTTAACCCACGTCAGTC 

AGTCAAATGCTAAGGCTCTTCAAGCTACACTTGGTTCCTCCACCCTCTAAAAGGTGAGAA 

CTCMGAGAGCTGGGTTCTTTGGGACCTTATCATATTTTTCCCCTCCCTAGGCCTTGATT 

TCCCATTTGGAAMTAMTCAGTGAGGGCTTTCTAGTTAAAAATGCCAGTTGAAGCCAGG 

CTTGGTGGCATATACATGTAGTTCCAGTTACTCAGGAGGCTGAAGTGGGGAGGATCGCTT 

GAGCCCAGGAGTCCAGTCCAGGCAACATTGCAAGATCTCATCTCTAAAAACTAAAAAATG 

GACCAG 

Sequence 882 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAATGAATTGAC 

AAATGGAAGAAAATTTAAGGAGAGTAGCCTAGAGAACTCATTCTAGAAACTAAATAACTT 

MGTCMAMTTATTTCCTATATTGCCTCAAGCCCTGCAGATAGCTTTGCTATGTTTGTG 

TATTTGCACATTGCACTCCAGCCTGGGCGACAGAGACTCTGTCTCAAAAAAATAAATGGA 

ACAATCACAACAGAAACATTCCCTTATTCACTGAACATTTCAAACCCTGAAAATGTGTAA 

TGAGAMTGACAAATTTTTAAAAGTTTAATTACTAAAGAAGACAAAAAATGTCTATTATG 

AATAGACCAATTCTCAATTGGTAGAGGAACTTTGAACTGGAAAGGAACCCTAAAGAAATC 

TCCTGTCTACCCCTGTTATTACAGATTAGAACCCGAAAGTCCAG 

Sequence 883 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATAATTATAATGATTTCAGATAGAACATGCA 
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ATTAGCCTTTTGAMTCCMCTTCTGTGCAMATTTTAGTATCAGAAMTACGAGATTTG 

CAGGGGGAMCATCAGTAMCTACCATTMTGTCMTGCCCAGTTTTGACTTTTGTTAGC 

CTGACACTCCCAAACAGTTGTAGAATCCGATAGATGACTGATGGCAAAAGATTGTGAACA 

TGTGGAAGAAAATCAGTGGGATTCGGTGCTGATGAATAGGTTGCCTTCAGAGTATTATTG 

ACAGACAGCTTGTGGMCTMTTCTTTATTTTTGATGTTGTGGGAATTAACACATCAATG 

GTGGTTATGGGAACTACCAATGGGTTCCTACAAT 

Sequence 884 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATTCTAAATTATAAG 

AAMTATACATTTGCACTTATTMTATAGAAATTCATTTTGTGTATATTTAACATAGCTT 

TTAMCTATTTTACATTAGCTACTTTCATTATGGTTTCTTGAACTTCTGAAAAAAATTA 

AAATGTATTAMCTTATCAGTMCATAAAMCTTATTTTGTTTCACCTAACGAAT^CTGC 

GTTTGTAAAMTAAATTTAATATAGAATATATTTTTAMTTAA^ 

AGCTCTAAGAMGMGCAAATTATCACTGMCATATTTCTTATTATTTCTGGCTTTGAAT 
TMTACGTAACTTAAATTGGCTTAAATGATCCAGAATATTGGAGGAATATGATACTTTCA 
CATMTATACTATGAACCTGTTCATATAACTCTGGATTGGCTACCTAACCTTCTGNTTTA 
ATG 

Sequence 885 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCACACCCAG 

CCAGTCAATAACTGAGAAATCAAAATAAAATAATAATTTCAAAGAATTACATAAATACAG 

GGCCTTTTGAGATTTTTGGCMTTGTAMCAAAAACGAATGGTTTTTACMTTCAGTGTA 

ATTCTACGAATATTTATTTGGCACCCATGTTAGGCACTGAGGCTACACAGCAGTGAAATA 

GGCCTAGTTGTTCTCAACTAGAGAACATAGTTGGTTAATGTAGCTGCACTGAATTGTAAG 

CTGTT TAGAA GATMTATACCCTGAGGCTTTTTAAAGTATACTATTACTATAAGGAAGTA 

AAATTATTTTATACTTATAAATTTTGTTTTGGATTATTCAACT 

AATTTTATGGGCGGTTGGGGACAAGGAAGAGGTATAATGCTA I Willi I CCTTTTCTTT 
TTT 

Sequence 886 

CTACTATAGGGCA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA Ci I 1 1 I I 1 1 

I I II I I I I I 1 1 II IAAATTTCCACCGGTGCCGAGGCCTCAGTGGAGCCTGGCTGGCGGCT 

TGTTAGAGCCTGCAGCCTACCTGTCCTGCATAGGAATGAAGCCGGGAGGAGTTACATGAT 

ATGCCCTCGTTGCAGGCCGGGGACACAGCTACCGCATTCAGAGACCAGGAAACAGAGCAA 

AAGCTGTTCTCANAGTGCGGCTGAGCGAGGAGCTACAGGGGAATGGNGGGGGCCAAGCTG 

CATGGMGATTGTCCCATTMCCTGGCTTTTTACCAGGGTGGTCCTNTCCCTAACCCCTA 

AGAATCACACCCTGCATCCAAACGGCAGCAACCCCAAAA 

Sequence 887 

CNCTACTATA GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I 1 1 I I I II I I I I I 
mTTTTTTI I AMG CTGGGATATCTTACAGAGGAAGGAAAAATTMCCTTTTTTACTTT 
CTTTCTCACTTTTTAAATCAGCCAMGTCAAAGCCCGTTTGCCAACCTGCATGTCCATGC 
CTGTAAGCCCTTCTNTTGGCCAAGGAAGAAAGGAAGAAAGAAAAAAGAAACCCAGGGGCC 
TGTATCCCCTGATTAAACACAGCACAGCACTCCAGGCAGACATGCCCGGNGGCGGCTCCT 
TTGCACCATTGACCTCAGGCCAGACACCTCAGCGCCAACAATGGGACCTCGGCCTTCCGG 
CTAGGTTTGCCCCAGGCTGGGCAGGAAACCAGCTCGGCCGCTNTAGAACTAGGTG 
Sequence 888 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGACATTGTGGCTAGCCAACCACA 

TGGTCAGCCTCAAAGTTGAGAGGCTCAGTAACCCTCCTATCCCTAGAGAATTCCAAAGTG 

TGGATGTAATTTAACCTAGGAAAGCCATTGGTGACTATCTGTGATCCTCTGGAAAGTATG 

CTATGTTGGGGTATATCTTTGCATCCAAAGCCAGAGGGGAACCACAATGGCCTAGTAAAA 

CCGGTGGGTCCTCAAATGCCCACTTAAGCCTCTGGCCTNTTGGAANTTTGACCCATAGTG 

GGCCGTTCAGCTTGATTAGAGCCGGGGAAAGAAAGAAAATATTGNCAI 1 1 1 I I I INTTGA 

AAAAAAATTT 

Sequence 889 
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CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGATCTATGCTGC 

TATGGGTGGAGAATCGACATCCTTTGAAACTGGCCACAGGCAGAGCTAAGAGGATGACTA 

AAAGGTCCCTTGGGTGGGTGCTAATGAGCAGGGGCCCAGGAAAACCTCTGTCTTCCCGGA . 

GAGCCCTCTTGCATGAGTTTCGGCTTTGCCAAGATTCCAGGGACTTGAGGACAGCTATTG 

AGTTATGGTTACGTGACTGCCACATTGGGGCTTGGAGGCATCTGGCAGATGGTTGGGAAT 

GGGCTGGCACCACACTAATTAGGCCACGATGATCCAGTTTGACTCAGGGAAACCCAGAAG 

TCATAGTNCTCTTTGCAGMTGACACMGGATGTCMCATGCTTTGNTTGTGTACCTCGG 

CCCGCTCTAGAACTAAGTGGGATCC 

Sequence 890 

CCGCGGTGGCGGCCGAGGTGCATATATATATATACACACACATATATATATTTATGTATC 

TTTTAMCATATAATTACTCTCTTTMTTCATTTGGACTTTTCATCTAAACTTGCTCT 

TTGCACAGGTCTGTTAGGGTAAGATATGTTCCTACCTTGAGAAATGTTGTGAATATCTAG 

CGAAACACCAAACATCCTCAGCTGACTAATGTGGTATCAGACTTTCTGGTTGCAAGGTAG 

GGG TGAA TAAGGCAGGATGGGGTGCGGGGGTGGTGCTGGAAGAAGACATGGCATCAGGTT 

GGGTTTTGCAGGATACTGAAATTGTCTAGGGGCCTTGGCTGTGCAAAGAGCCTTTCCGTC 

Sequence 891 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACGCGGGATTTCT 

CAGATAGTTATGCGCAGCTCCAGGCACCAGATTCTGTGCTGGGTGCAGGCAGGACCTGGA 

GGGCGTCCTCAAGTGTTGATCTGCAGGGACTGTCTTGATCTTTCCAGCAGTGTCATTGTG 

GGCACGTGACCTGAGCTTTCTGAGCCTATTTCCGCATCTGTAAAGTGCTATCCACTTCCA 

CCTCCTGGGCTGTCGTGCAGATGTAGGAAGGAATTGCACTCACACACTCAGCATGAGACA 

GGCGCTCAGTAAAAGCCCGTCCAGGGGATATGAGATCAGTGAGGGATAGGAAAGCAAGGG 

TGGGTAGAAACAGCAAAACCCTTTCCCA 

Sequence 892 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAGCATTG 

GACACACAAAAATACAGGCAGCTTCTTCCCTCAAGGAGGTCACAGGTGGGTGTGTCCATA 

GCAAAGCTGGGAGGAAGTTGTATGAAGGAGCCTGAAGACAATGGGGAGCTAGGGGAAAGT 

TCTGAGTAGAAAGGAACATGTGGACAAAGGTTTGAAATGATGAAGACTGATTAGGAAGTT 

CATATTATGMGCATAATTCMGCTTTCTCTACGATGTTCAAATCCCATCTCTCCTACTT 

ACTAGANAGGTGACATTGGGCCAAGTTACTTATCTCCTCTGCTCCTGTTTATTTGTGTTC 

AAAAACAGGGACCTCTCTCACAGTGTGATTNTGAAGACTGGACAAGAAAATGGGAGGTTT 

TG 

Sequence 893 

CTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGTTTGGAAGTTT 

AGGCAAAAGTCATTTCTTCCCTATATTTTGTCATGCTTATCTCCTGTCTCTTTCTGTTTT 

ACAGATTAGCMTAMCTCCTTAMACCCAAMGGTTTGGGCTTCTGTTCCTTTCACTTG 

CAGTCAGACATGGAGTTAGTGGTAGAAGAAACAGAAGGGGTAACCTGCATGGTGACAGCT 

ACTGAGGGGATGGATAGGAAAGCAGGCTGAGTCCCTGGGGCCAGTGGTTACCAAAGCCAA 

GGAGAGGGCAAGGGGAGCCCAGTGGGCCTGG 

Sequence 894 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACACGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGGCACCACCAATTACCAG 

AGGAACAAAAGGAATATTGAGGGATGCGCTCAACCAACTNTT 

Sequence 895 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAGTAGTCTAAAACAA 
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GTGACTTTACTACTTATTCTTCTGCATGTCCTTACCAGCTTCTTACCTTCTTCAGGTTGA 

GCATGAGATCAGCTTCACAGGGGATGGGGTCCTTMGGGTTTTTTTCCATACTAGTTTCA 

GCCTTAACAATGAGTTTTCAACCCTTAAACATGAAAAATAAATAGTGCGAAAGAGGGGAG 

GATGGTAGAAATGCTTTAAMTTACCTTTTGTAAATTTTACTTTGTTTATGT^ 

TGCCTTGCTTATCAGGGAAGTCCTACAAACAAAGAACTCCACGGCTTCTTCAAGTCTTCC 

MGGGAACAGGGTCCCCCTGGTTCCTAAAMTCAATGGGMGTAGGTTTTTGGTAACCAT 

CTACTGGTCMNGGGNMCCATTTCTACCTGGCGGTTTATTACACCTTTGCTAGGCTTCT 

TTTTTCTTTTTCATTTTTAAAAMTMTTTTT 

Sequence 896 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTCATATCCCAGTTCTA 

GMTCAGTTCATTTTCTAAGGAGTCCTGGTTCCTTTTATTGGAAACCAAAATCTGGGCAC 

CAGGTGTGCTCCCATTCTAGTCGTTTTCTGACCACATAACTGCTAACAAAGATGCTTCAC 

TCTGGCTACACTGATGTGMCTTTGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAAT 

GAGCTCCTACATTACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAAATAT 

GCAMGAATTTTCAGTATGAATGAAATAAGACAAACCATCAGGCTTGCTGTATTGTAAAC 

CMCACAATATAGTTATAACAGATCTGTAGAAGGGATCCTTAGAATAAGAGAGGCATTTG 

TCGGGGGGTCATCAGGGAGAATACTGGATAGNATCTT 

Sequence 897 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 1 1 I 1 I I 1 1 1 1 I II 1 1 I 1 1 

TTTTTTTCTTC ATGGCT ACATCTGMCAGCTACTGAGGGATATATATGCCMCTTTGGGA 

GTTGCACAGCTTTTTTGAGGCCATTTTNTMNATGACTAGGGACTGTAATT^ 

TTGGAAT AG CCAC AAGTTGTTGTAG CC AAG GTTTG N G NG NGTTTTTAATAC AATTCTTAA 

MTTTTAGTAGGCTTCTCATCTGTANATAGATTTGAAGGGGNGGGGTTGCCCCTCCACAC 

CTGTGGGGTGTTTNTCGTAAGGNGGGACCAGAGACTTAGGA 

Sequence 898 

CCGCGGNGGCGGCCNAGGTACACCAAATGGATTACAAGCAGCATCCAGCAGAAGACAGAC 

CCCCCAACCCTGCCCACCAGGGCTCACACTCTACAAAACCCTGAGGGCCTAGAAATCTGT 

AAATGCATCGNCAAGCACTGGGGCTGATTTGCAGTAATTCTCTAAGCAAGGCAAACATGA 

TCTAGCTTTGAAGGCAGCATGAAGGCAGCGGGTTGGNGAGAACAATCTNTCCTTAAGAGA 

AGAAGAAACCTGGGGCGGANGGAGTTTTCCCCGG 

Sequence 899 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTTANGGTCTTGAGTTAATTGCTCTGTGGCT 

GTGGATTTTTATTTGATGTTCTGATCTCTTCCTTCCAGTTTGATAMTTAGTGTAGAAAG 

TGGAAGAAAAACATGCCGGCGCAGCCTCTGCGCTTTGTCAGGTTAACAGAATGGAGTCCT 

GCTCTGGCATCAGTCAGTGCTGTTGTCCGAACCCTCTGTGGCTCCTTCCTCCCTCCCTGG 

GGCCCAGAGCTGCAGACGCTAGAGGGGTA 

Sequence 900 

GCTNCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAATGTTAAACTGAGGGATGAGT 

GAAACAATATCAGGATTAATAAATAAACACATTCTTGAATTCCATCACTTAATAGAAGTG 

GCCATTTGAATGCTGGCAGGTNGGAAGAAAAGAGGAGGACAAAGAACCCCAAAAGTTTGG 

CATCATAACTACTGCCACAAG 

Sequence 901 

ATA GGGCN AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I II I I I I I I II I I 

TGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGAC 

CTTGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCG 

CCTGGCCTTTCCAGGGTATTCTTTTAACGTGGTCTTATTTGCCTTTTTGMTTT 

ATCTATCAGCATCATATACCACCACTGGAATATAAATTTGAAAGAGAGTCCTGCAGATTA 

TATACATGAATCTACTTAGGCCTMTAACCAAGCAGTCCTCAGTGGCAGATCAATGAAAA 

GTGAAACTAAAGGCAAGTGAAGGGTAGGAGAGATTGGCCAGT 

Sequence 902 

GGCAGGTACCCACCTCCTCGGTCACCCACAGAGCCACCAAAGATTCCATGTCCCAGAGCT 
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TCCCATAGCAGACCTGAAAAGTCCATGACCTGAGCTTTGGCCATGGTAGTGGAGTGGAAC 

AGGAAATAGTCCAGCAGAGGAGTGTGGGGGAAGGGGGCAGGAGAGGCACAAGAATAAGGG 

AGACCTGGACTCTGGCTTTTTGGGAAAAGGAACCAAGCTCATAGCAATTTGGCTGATNAC 

ACMTCAGA7TTTTCCAGGTTAAGCTTCCTTTCTG 

Sequence 903 

CCGCGGTGGCGGCCGAGGTACCATCTACTGMTGCCCAGTTTTGATCTATTTCTAAATGG 

AGCAAACCAATTCCATCTCCTAGAGCTGGAGACTGTATCCAGGCAGTGTGTGGACAGAAC 

GGACMTCTTTTCTGCCAAGGGCCTATTTGAGTGGAGCACCCCCACACGGGTTAGACGGG 

TCGGCACGGGGCTGGTGGGTGAGGAACTCAGGGGTCAGTGCAAGCTGCAGACCCTCATTT 

GGGGAACGCTCTCAGCACAATGCTCTTACAACTACAGGGTGCACTCCAAAATGGAGTTCA 

AGGAAAAAAGGCTAATGAGAMTAAAATCTGAAAAMTAACTTAAAMGTTTTGCT 

Sequence 904 

CCGGGCAGGTACGCGGGGGCCCTTTGGATACCTGCACTCCCCATCACCGCACTCCCCATC 

GTGGCACTTCCCTTGTTGCAGTTTTATGGAGTGTGCGTCTGGCTCCCCAACTAGACTTGA 

ACCGCTTGGGTGCATMCTCGGGACTTGACCATTTGCGTCTCCCTACGGCCAGCTCAGCC 

TCCGCACACAGGGACCTGCAGAGAGTGGATGTAGCCACTGCCCCAGCGTCCCTGGGCTCT 

GAAGAGAAGCCATTGCCCTTCAAGAGCCACCCTCATTTCCTGGGCACTGGTTTGGAAAAA 

ACGAAGAAAAAGAGACACCCAGCTCACCTCCA 

Sequence 905 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCNGGGCAACTCATTCATGATATTGGG 

AGAAAAGCAAAGCAAAAACTGCAACAAAATCTCAAACCCTTTCTGCAGCAGCAGATGGCA 

AACAGTGATCAGAGGAGMGGACCCTTCCAGCATTAGAAGATTTCCAAAGGCTGTTCCAG 

TAGGGGCTGTGGGCTTCTGGGAGCCCAGATGCCCCCTGATGGTATATTTGAGTTTGTGAG 

GTGGAGGCCAGGTGGCAAGANACTGCNNGCCAATGTCAATGAAAAGCCTGGGAGGAAAAA 

GAGATTTCTGGGA 

Sequence 906 

AGGTACTTTGCTMCCAGCATTTTGGCTGTGTTATTGGCAGTTTTCAMm 

TTGC CATC I I I 1 I GAGAGTGCATAGACATTTMTTTTAAGAMTTTATAGAATTGGACTT 

TTTTGTCTCTATACATTT GTAG GTCAGATGCACATTTGTTTCCTGTTTCATCTTTCTTT^ 

AGAGCAAMA TGTAM GTTTTGTATGTAGAGGATAATTGTATGATGATGATAAACTAATT 

AGGTATTACAGTTTTCTMCGACAGAMTTTGTAATAATTAGGTAACnTGGTTCATATTA 

AAATATTTGATACATAGGCCGGGCATGGTGACTCATGCCTGTAATCCCAGCACTTTGAGA 

GGC 

Sequence 907 

GGCAGGTACCACACCCATCTTACCCTCTTCCCTCTAGGTTCTGACATTCAGCTATCTTGG 
TGGGAGGCTGGGGAGCACTATTGGGGATGAGGGTAAGGTGGAGTTTTATAAAGCTCTCCA 
GGTGACTCAGAGACCACCCTC ATTCCACCTGGTCACAAATCCCTGAATGGGAAACAGGTA 
CI I I I I I I I I I I I I I I I I I 1 1 1 I I G C A AAG AATTGTAAA ATTT ATTGT AT AAGT ATTG C A 

GCTTTTCANAATGTCATCATTGCCACTMTGATTACTGATACACMCMGCAGTTTCTTC 

AGGCCTGTGGATTGGCATC 

Sequence 908 

AGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGTGCTAATATCAAT 

GCAATGATCCCAATGACGATGATTGGAi-.AAAACTTCAATGGCAGCAGTGACAGGATCTGT 

GCAGCAACAGCATCTGCATCTGGTGCMCAGGACTTATTTTCAAATCATCAAGGCCAAAA 

AGCGATCGGAATGAGMGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGGTGACT 

ATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCC 

AAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCCCATA 

Sequence 909 

AGGTACAGCAAATTAAACCCAATMCAGGAGGAGGAAAACACCTAATTAATATAAAAATT 
TCAAGGATATGTTAAACAAACAAACAATTACAAGAAACTCCTGTCAAGTTAAACAGAGAG 
AGAGAGAGACAGGGAGAACCACTAACAGGAAAGAGAAAAAGATATACAAGCTATCCCGCA 
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GAMTTAMAAGAGACTMGMTATTACAAACAACTTTATTTCATTTAGATGAAATGGAC 

AAATTTATTTTAAMCACMTTCGCCAAAATTGACAGAAGTGGMGTAGAAMTCTATTT 

CTGTATTTATTAAAGATACTGAATCTATAATTAAAAATATCTTCACACAGAAAATTACAG 

TTTCAAATG 

Sequence 910 

AGGTACGATCTGAAGAAATGAAAGGCATTGAACTTTGGTGGGTAAATTGGGTCTTTTCCA 

GCAAAGGTATAMTCCTTAAMGCCMGATCATATTGTTTGATTTCTCTGGGCTCTCTGC 

TGGATACAGTGCCAAGTCCATAACTGTATACCCCATGGACACTCTATGTTAAATGGAGAT 

TAATGTGTAAGAGGTG I I I I I I I I I I I GTTTTGTTTTG ! I I I 1 1 AATTTGGAAAAGAAGC 

TTAAAGACCACAATGGGTGTGGCATTGGCTCGACCCACAGATCTGCTTAGTCTCAGACAG 

GCACTTTGAACCAGTCTTTTAAAATTGCGTCACAACAAC 

Sequence 91 1 

AGGTACAGATCACTATGGCTTGTCTTTTCTCCTAACTAATGTAAAATTCCCAATAATTCA 
TAACTTGTATGAGGACAACAGTTGTGTGMTCTACCCTGGTCCTTCTGATNATTTTTAAT 
TTTTTNAI I I I I I I I I I I I IGGGGACAGAGTCGTGCTGTTATCGCCCGGGCTGGAGTGCA 
GTGGCATGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCCAGCAACTCTCCTGCCT 
CAGCTTCCCGAGTAGCTGGAATTACTGGTGCCCACTACCACACCCGGCTAAI I 1 1 I I GTA 
TTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGGACTC 
Sequence 912 

AGGTACAAATTGTCGTTTTTATTCCTCTTATTGGGATATCATTTTAAAMC 

TTTTTATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGT 

TGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTGACTTCTTGCTTCAGT 

CATCAGCCAGCAAGAGGGAACAAAACTGTTCTTTTGCATTTTGCCGCTGAGATATGGCAT 

TGCACTGCTTATA 

Sequence 913 

TGGCCAGNTCAAATNACAACCCCCCAACCCCCCCCCCCCCCCCCCACAAACAGACAAGGA 

CACAGNTCACCANACAATGGATGTNCAGGNANTNGATATCAGCAGATATNTTTAGNCCTNT 

AGATAGGCTMTTTNANTNAGCAAAGGAAAGAGGAGGTANCATTAGNCAGATGGGNTATT 

NACCTCTGAATTAGATGGCACTTACCCANCTTCTGGNACAGNCCTGCTGGNGGCGTCTAG 

ACTAGTGATCCCGGCTGANGATCGATTAACTATCATCCGCGACCTCAGGGGGGGCCGGAC 

CCACTTTTGTCCTTA 

Sequence 914 

CGAGGTACGCGGGACACTGGTGGGGGAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCG 

CAGGGCCTCTACGGACCTTACTAGAAAAATGAAACCTGATGAAACTCCTATGTTTGACCC 

MGTCTACTCAAAGAAGTGGACTGGAGTCAGMTACAGCTACATTTTCTCCAGCCATTTC 

CCCMCACATCCTGGAGMGGCTTGGTTTTGAGGCTTCATGCGAGAAAGGGGAATGGGGA 

ATGGCTGCTTAACGGCATGTC I 1 1 I I I I I I I I I GAGACGGAGTCTTGCTCTGT 

Sequence 915 

CGCCCGGGCAGGTACGCGGGGACTTGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTT 
TCTCCCCGCTTGCCGGGGTGGTCCTCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTG 
TGGAGAGGGACTGCGTGTTTGGAGGGAAATGGGAATGTACCT 
Sequence 91 6 

CCCCGCGTCCGCTCTCTGTCGGGGTCCCCTCCATCTCGCTGCTGCTGAAGGCCGCGAGGG 

CGGCGGCGATGGCGGAGGCGGCGCTGTTGCTGCTGCCTGAGGCGGCGGCGGAGCGGGACG 

CTAGGGAAAAGCTGGCTCTCTGGGATCGGAGACCGGACACGACGGCGCCGCTGACCGACA 

GGCAGACGGACTCGGTATTGGAGCTGAAGGCGGCGGCAGAGAACTTGCCGGTGCCAGCTG 

AGCTTCCAATTGAAGACTTGTGCAGTTTAACATCCCAGTCACTGCCCATTGAACTGACTT 

CAGTAGTGCCTGAATCTACAGAAGACATTCTCTTGAAGGGCTTCACTTCCTTAGGAATGG 

MGMGMAGMTTGMACCGCACAGCAGTTTTTTCTCATGGTTTGCAAAGCTGCAAACT 

CAGATGGATCAAGATGAAGGAACTAAATATAGGAGCAGTGTGATGCTATATTGAATGATG 

TAAACAGTGCTCTTCAGCATCTGGAGTC 
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GGCTGTGGCCAAGAAACGCAGGGACCGCTCTCTCCCCCGGGCTTTCGAAATCTTCACAGA 

CAATAAAACCTATGTCTTTAAGGCCAAGGATGAGAAGAATGCAGAAGAATGGCTCCAGTG 

CATCAACGTGGCAGTTGCCCAAGCCAAAGAAAGGGAAAGTAGAGAAGTAACCACATATCT 

GTAGGGAATTTATAAGTCAGCCATGACAATTATACACCACAGGCATTGTATTATCATTGC 

CMTGTCAAGAAAAAGAGCTAMTTTACCMGCCATGGTTGGNTTTTTACTAAATACCAT 

GGGMTTTGTTGGTCCTTTAAGAAGAAGGGCCTTAAAATGGCAGGGATTTCTTAGTNAAA 

TGNCAATACTCTAACAGCTTTAGTATTGACTTTAGAATATATCTGATGCCCACAAAAATT 

AAATAAAAGGGNTTNGAGGAGGTTTGCCCNAAATAAGTGNGGGGCCCGGAGGGGAA 

Sequence 91 8 

AGTCNCCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTACGGGGGTAGCT 

TTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACCATTTC 

CTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCATTGGCA 

GTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTGAACAT 

GTCCCAGACAGTAACTTTGGTGTAMTGCTTCCTGTTTTTCTGCCACAAGCCCTTTGGTC 

TTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCAGCCTG 

ACACATATGGGACTGCTCGACAGGTCCACTGTCCCACGAGCAGAAGCTGGTCACAAAGCT 
TGGGAAAT 

Sequence 919 

GGGAGTCGACCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTCACGGGGG 

TAGCTTTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACC 

ATTTCCTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCAT 

TGGCAGTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTG 

MCATGTCCCAGACAGTMCTTTGGTGTAAATGCTTCCTGTTTTTCTGCCACAAGCCCTT 

TGGTCTTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCA 

GCCTGACACATATGGACTGCTCGACAGGTNCACTGTCCCACGAGCAGAAGCTTGTCACAA 
AGCTTGGAA 

Sequence 920 

AGTCGCCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCG 

TGGGGGATGGATGACAGTCCACCAGAAAAAAGTTAGTGGAGCGGGGACAAGCAGGGTTGC 

AGAGTGGAAGAAAAATGTTCCTGTGAGAAGAAACTGTCCAAAGAGTNTGAAGAGAAAAGG 

GAACAGGGTGAATTTGANGCCCTACAAGAAAAACAGGAGACCATTCAACAGGAGACGCCC 

AGGGAGCAGGTGGCTTTGTGGGCCTGATGTCCAAGAAAGAAGTNCTGGTGGTAAACAGAG 

ACTTGTGGATTGCAAGCTACTGTTGTCTTTCTATTGAAA 

Sequence 921 

TCGGAGTCGCCACGCGTCCGGCCAGGCGTGCCTGGAAATCCGCTTTCGCAGCGCCCCCTC 

GTAGCCCGCCTCCGCCCGCAGAAGGCGTTCCCTGGACAGAGAAGCGGGCGCGCGGGGGCG 

GGCGCGTGGGGCCTTGCCGGAGMCCTGACTCTCCGCAGCAGCAGTGGAAGCCAGAGTGA 

CGCGTTGTGTTGAACACCAGTTTTCTGGAGCGCTGTGTGTTCTCAACAGCTGAGCAGTCT 

GTTTCTCCAATCAGGTTTCAAAGCCACTTCAACTGCACTGGCCCCTGTGGGTCACTGCTG 

CACCGCCCTGGCCCATGTGGGTCCCTGAGGAGCGACCTGCCGGGGCCACCTGGCTGGACG 

AAAAAGACACACTTTGGGACTTAAGCCGTGAGAAAAAAACTTCATCAGTAAGAAACAAGT 

CAATAGACAAGTAAAAGACTAGGAGAAAATATGCATAAAACATAAAAAGTGACTTGGATT 

CCTGATCTTGGAGTATTTAMGAATTCCTATAACTTANAAAAGGTTTCMGTTTTTTNAA 

ATGAGCAAAANGGTTTGGGTAAA 

Sequence 922 

TCCGCAGGCTGGGGGATCCCAAAGGGTGCGCTCCAGCCCCCAACCCAGGCACTGGGACTC 

TGGTGGCACCCTGGGTGGCAGGCAAGCCTTGAAATCAAGTGCACGAGCCTTGGAAAGGAG 

GACCGGGAGAGTTATGGCATTTATGAATGAAGAAGAGAAAGAGAATCACTCGGATGGGAA 

AAGTTAACTGGATTGTTCCCACCTGCATGGATCACCCGGGTAACTGCAGTGGGACCGAGG 

GGGCGAGGCTGCGGGCTGGGGGATGTGCCGGGTTTCCTTGTGTTGCCACGAACCCAGAGA 
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GGGAGAGGAAGAAGATGGAAAGAAAAAGGAAAAAGGGAAGGAAAGTAAGAGGGGAGAGAG 

GGGGAAAACTTGAGGATGAAAGAAGAGACAGAAGAAAGAAGAAGACCTTGAGAGAGGGGA 

GGAGAAAGGAAAGGAAGCNGGAAGGAGGAATTGGAAAGTGAAGGAAAGGGGGGAAACCAG 

GCNGAGAAAGAAAAGAGAAAGGGGGAAGGGAAAGAAGAAAGGGGAAAAGNAAGGGGGGGN 

GGTTGAAAATCAACNCGAAAAAAAGAGG 

Sequence 923 

CNCGCGTCCGGCTGTGATGAATGAGGTCTAGGAAATAATTTGCATGTGTCTTGGGGGACA 

CAACAGTAACNGAGAGGAAATACATTATTACAGCAACTTGCGACGTACTAATACCTGTCA 

GTGTTGGCCCCCGTAAGGTATGTAAGGCACCTGNGANGTGCCCAGTNAGTNCCTTGGTGN 

MGGCCAACATGTACTAGTTATGTAAGTATTGGTGTCTGCTTTAAAAAAGGAGACCCAGA 

CTTCACCTGTCCTCTTTAAACATTTGAGMCAGTGTTACTCTGAGCAGTTGGGCCACCTT 

CACCTTATCCGACAGCTGACTGTTGGATGTGTCCATTGTCGCCAGTTTGGCTGTTGCCCG 

GACAGGACAGGACCTCCATTGGGCGCAGCAGCAGGTGGCAGGGGGTGTGGCTTGAGGGTG 

GGTGGCAAGCGT 

Sequence 924 

CCCCGCGTCCGCACAGATCCTTGAGCTCCGCTGCAGGATAGTACAGTTTTACCGCAGAGG 

GAATCTGGAACAGTGGAATCATGTGTCTGCCCTGTGTATTGCAGTTTGTATTGCCACAAG 

CTATATTTATAC CAGT GTCACCCTTTTCTTGTAGAATATACTAATAAATCTGTGCCAACT 

CTACCTTCTCACTTTTACCTCTGACGTCATTCTTTTTTTCTGAAAGAGGTMTMTTC 

GTFITGATAGACTCTGAGGATTATGTGMCAGGACATTTTTCATTTGTGMTTTAATGCT 

ATACTGTCAAGGTACTTGCT TGTGTCTG AACTCTAGTGCACTTATGATTTTGTAGACCCA 

TGTGAAATTTAATAAGATACG I I I 1 1 I I I CCTTTCTTTGGTGTGGTAGTGCAGCAACAGT 

TTGGTCTGCAT1TGTTAGMGTTTAACTCCTAACA/.CCCAAAGACCTATTTA 

Sequence 925 

GCGTCCGACCCCAAAGGGAGGGACCACATTGCACACACTGTAAGAAATGCACTTTCCGAG 

GAAGGGGATGGGGGAGCCCGGACACCCAGAGCTCCCCGAGTTGGGGGTGCCCGTCTGGAG 

CGCCCCCGTCAGCCCCTGGCGGTGGGAGGTGAGAGCGAGTGGTTTAAGTGCCTGATTACC 

ACCACCCGCCCCCCCCTTTGTCCAGCTGGGACACGGAATGGCCGCGGGCCTCCTCCCCCT 

CCCCTCCAGCCTCTCCACCAGCCCCTCCAGTCAACCCTCATCGCCGTGCCCCCCCAGAGC 

TAGAGAGATGGGGCCCCTGCGTGGCCCGAGGGGCAGAGCTGGGCGTCACTTCGCAAGCGT 

CCTGCCCTGCCGGGGCGCGGGGGTGGGCTCTGGGGAAGCCGGTGCGCCCCCCACGCCTNC 

GCTGCCAGTGCCTTACATTCTGGAGCGACCCCCCTCCCTGGTGCCTCCCAGCGAAGGGGG 
ACCCGC 

Sequence 926 

AGACAGCTCAAGCCTTGCCACTTCGGGCTTCTCACTGCAGCTGGGCTTGGACTTCGGAGT 

TTTGCCATTGCCAGTGGGACGTCTGAGACTTTCTCCTTCAAGTACTTGGCAGATCACTCT 

CTTAGCAGGTAGGTGCCGCAGACCCTGCGGGTTAAGAGGTGGGGTGGGGGGCAGTGCTTG 

CCAAGGCCCTAAACTGGGAGCGCTGGGTGAGGGGAACAACCCACTTTGGAGGGTTCTCTG 

AGAGATAGATACACCCCATATCCTGGGCCCAGCTCGTGCACACAGCTGGAGGTCCAGAGA 

CCCAGTCCCCTCTGCTCCGTCAGCCAAGTTCCAAGAAGTTGAGCAGAGACCCTTCTGGGA 

GCCTGGCGGGGTGCAGCGGCCTCCCCTGCGGGGCCTGTCACCCGGCCGGGCGCGTGCAAA 

CGCCTCTGGCGCCTNTNTGCGCGGGAGGGGAGATAAGCGTCTGAGCCAGGGAAAGCCGCC 

GGGCTAAAACCCGCCTTTTCCGGGGGCCCC 

Sequence 927 

CGCGTCCGGTCATATACMTGTCATTGTTTGGGACCCGTTTCTAAATACATCTGCTGCCT 

ACATTCCTGCTCACACATACGCTTGCAGCTTTGAGGCAGGAGAGGGTAGTTGTGCTTCCC 

TAGGMGAGTGTCTTCCMAGTGTTCTTCACTCTTTTTGCCCTGC1TGGTTTCTTCATTT 

GTTTCTTT GGAC ACAGATTCTGGAAAACAGAATTATTCTTCATAGGCTTTATCATCATGG 

GATTCTTCTTTTATATACTGATTACAAGACTGACACCTATCAAAGTATGATGTGAATCTG 

ATTCTGACAGCTGTCACTGGAAGCGTCNGTGGAATGTTCTTGGTAGCTGTGTGGTGGCCG 

ATTTGGAATCCTCTCGATCTGCATGCTCTGTGTTGGACTAGTGCTGGGGGTCCTCATCTC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

148/467 



GGTCANGTGACTTTCTTTACTCCACTGGGAAACCTAMGMTTTTTTCATGATGATTGGG 
TGTATTCTGGGTCACTTTCTCTTGCCATAAGCTATNCTCATTCCAGTAGTTT 
Sequence 928 

CCACGCGTCCGGACGCTGCGTGGAAGCGGCGGAGCCGGAGGGAAGCAAAGGACCGTCTGC 

GCTGCTGTCCCCGCCCCGCGCGCTCTGCGCCCCTCGTCCCTGGCGGTCGCTCCGAAGCTC 

AGCCCTCTTGCCTGCCCCGGAGCTGTCCCGGGCTAGCCGAGAAGAGAGCGGCCGGCAAGT 

TTGGGCGCGCGCAGGCGGCGGGCCGCGGGCACTGGGCGCCTCGCTGGGGCGGGGGGAGGT 

GGCTACCGCTCCCGGCTTGGCGTCCCGCGCGCACTTCGGCGATGGC I I 1 \ ICCGCCGCGG 

CGACGGCTGCGCCTCGGTCCCCGCGGCCTCCCGCTTCTTCTCTCGGGACTCCTGCTACCT 

NTGTGCCGCGCCTTCAACCTAAGACGTGGACAGTCCTGCCCGAGTACTCTGGCCCCGAGG 

GGAAGTTA 

Sequence 929 

CGACGGCCA NGGCG CCTCCGAGTTQCCCGCCAGGACTCGGAGGGCCAGGAGGGCGCGACC 

TGGGTGGATATTTTTGTTGGACGGCGCAACTCTTGGGGTGGCCCGGGAGCGGCGGAAACC 

GAGCGAGAGAACCAGGAGGCGCTGCGCAGAAGGAGGCCCGGGGGCTCCGAGGCGTTGAGG 

GGCTCGATCTGCGTTCTGGGGTTGGCAGCCGAGAGGCCGCGGTCCCTGAGTGCCAGAGGT 

GGTGGTGTTGCTTATCTTCTGGAACCCCATGCAGCCAGATCCCAGGCCTAGCGGGGCTGG 

GGCCTGCTGCCGATTCCTGCCCCTGCAGTCACAGTGCCCTGAGGGGGCAGGGGACGCGGT 

GATGTACGCCTCCACTGAGTGCAAGGCGGAGGTGACGCCCTCCCAGCATGGCAACCGCAC 

CTTCAGCTACACCCTGNAGGGATCATACCAAGCAGGCCTTTGG 

Sequence 930 

CGTCCGCTTTNAGACCGGAAGACATTTAAAAGCCAGTTTACGTACANGMGCATGGTTTT 

AGATTAACTGCCTGTTGGTACAGCTAGAMCATTGCAGCCCTATCGCTTATTTATCTTGC 

ATGTTGCTCTGCTTTGCTATGAAAAATATCGTTrTATGATAAMCTTGTTGAATTTTGAT 

ATGTATTCGGTTATACTCTTAGGGAAAATAATAGAAATTAGAGTGAGAGAAAGTGCTATG 

TATATTAGGCTTTCAGATTTTATAGATATAGGCTTMGGGAGGGTGGAGGTTCTTTTT^ 

MGTTGMTGACTACTTAMTTTGTTGATGTGAATTTAAGTTTTAMGATTATTATTM 

TAACTCTTCTCTTTGTC TTTGC ATTTACCTTCCCAGATGTTCCAGCCTATCATTTTAC^ 

ATTCTCATTCTTGTATTATTTTCATCACTTTCTTACACAACMTATTTA^ 

CTTTTTACACTGG I 1 1 I IGGTAC 

Sequence 931 

CACGCGTCCGTGGAGTATGTGCCATCTGCCAAAGTGGAGGTGGTGGAGGAGCGCCAGGCC 

ATCCCTCTAGACGAGAACGAGGGCATCTATGTGCAGGATGTCAAGACCGGAAAGGTGCGC 

GCTGTGATTGGAAGCACCTACATGCTGACCCAGGACGAAGTCCTGTGGGAGAAAGAGCTG 

CCTCCCGGGGTGGAGGAGCTGCTGAACAAGGGGCAGGACCCTCTGGCAGACAGGGGTGAG 

AAGGACACAGCTAAGAGCCTCCAGCCCTTGGCGCCCCGGAACAAGACCCGTGTGGTCAAG 

CTACCCGCGTGCCCCAC 

Sequence 932 

GGTTCGCCCACGCGTCCGCCCTGCTACCCTGGGAGAAGCCTCAGCTTTCTGGGCAGAGTT 

TGTCTCCCTGTCATTTATACTCTCAGGCTTTATACATTTACACAGTAAGTTCTCCCTCCT 

GGAGGGTTAAAAGGAATAATTTCAACAGGGTGAAGGCCTGGCACGGTGGCTCACAACTGT 

MTCCAAGGACTTTGGGAGGCTGAGGTGGGTGGATCACCTGAGGTCAGGAATTTGAGACC 

AGCCTGGCCAACTTGGTGAAACCCTGTCTCTACTAAAAACAAAAATTAGCCAGGTGAGGT 

GGCACACACCTATAGCCCCAGCTACTGGGGGAGGCTGAGGCAGGAGAATTGCTTGAACCT 

GGGAGGCAGAGGTTACAGTGAGCTGAGATGGCACCACTGCACTCCAGCCTAGGTGACAAA 

GCAGCAAGACGCATTCTNAAAACAAAACANCAACAACAACAAAAAACGGGAAAACA 

Sequence 933 

CNCGCGTCCGGTCCACTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTG 
GTTCTTTCGGCCGTTCACGTCATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGT 
GGCCTCTCGTGGCTGGCGGCAGGTGGGGTGATGGTGCTGGCCTCGGCGCTGCTGTGTGTG 
ATTGTGTCTGTTCTGACCAACGTGCTCGTGGGTGGAAACACCCCAAGGAAGAACCCCATG 
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CATCCCAGCTCAAGGTGGTCAGAGCTAGACCTTCTTATTCTGTTGGGGACGGCGGGCCAC 
GTCTTGAGCCTGGGCGCCAGCAGCTTCGTGGAGGAGGAGCACCAGACCTGGTACTTCCTT 
GTGAACACCCTGTGTCTAGCTCTGAGCCAAGAAACCTACAGAAACTACTTT 
Sequence 934 

TCGCCCCGCGTCCGGTTATTTTACCCAGAAGCCGGATAGAGAAAATATTACAGAGAAAAT 

CACATATCACATGGGCTCGAAAGATGTAGAGGTTTTTGACAAATGAAGAACAACCATAAC 

AGGTAGAGGGAACACCATGAACCAGGGCATGAAACTGAAAGTGCATAACATATTCTAGAG 

AGAGAAGGGTGTGGGCATGAGTTAGGGCTGGAAAAACAGGTTGGAAACAGATAAGTAAGG 

GTCTCAAATGCMTGTCAAAGAGCTTGCAGTTTATTTTCCAGGCAATGAGTAGGCAGCCA 

AAAAAAAAAAGTAAGGATG I I I 1 1 1 I I I I I 11 1 CCCATGGCATCATATTTAAGAGGATGG 

ATTTAMTTGTGTGAGACCAMGCATAGAGACTAGATMGAGGGCGATCATTATTTCAAA 

AAGAAATAATGAAGATCCAATGAAGGAAGTGGGAAATTAAAATAGGGGAAGAGAGGTA 

Sequence 935 

CCGTCCGGTTTTTTGTCTCAGAGTTCTTCAGGCTGTACAGGAAATGTGGTGCCGGCATCT 

GCTTCTGACGGAGTCTCACTCTGTCGCTCAGGCTGGAGTGCAGTGGCATGATCTCGGCTC 

ACTGCAACGTCCGCCTCCTGGGTTCAAGCCATTCTTCTGCCTCAGCCTCCCGAGTAGGTG 

GGACTACAGTGGCCATGTGTCTGAGATCTAACCAAGGGAACATGGGTGGAACTGATGTAA 

GCCACTTTGACACCACAAAACCTCCCATGGGTTCTCTCTCTTCTTCTCTGGTGTACTTGT 

TGGATGGAGAAGATGCTGAGAAATAGTGGGAAGTCCTAGGGGATGGAAGAACCCAGGATT 

CTGMTACTCCATTGGACCTTACGTTTTGGAATCAGGNATGATGCTGGCCTTCATAAAAT 

GAGTTATGGAGAAAGTCCCTCTTTTTCTGGTGTTTGGMCANGTTTTCAGAAANGMTTT 

GNTACCCAGCTTCCNTCTTTTGTACC 

Sequence 936 

CCGGTGAGCGCCCCGCCCGCTCAGCCGCCAGATCAACCTTAGCGCTGGGGCGCGGGCTGG 

GGTCGCCAGGCGGTGCGTTCTGCCCGCGCGGGGCTGAGAGTTAGGGGCCGGGGCCGGATC 

CGGGGCCGGGGGTCGCGCCGCTAGCCGCCAGCAGCGCAGTCCGGGCCGCCACCCTGCACC 

CTCCGCCCTGTTTCTGCACCCGTCTGGGTTCTTGTGCCGCCGCCCGCAAGCCTTCCCGAG 

CTCAGGGTGGTGAGCTGCGGAGACCCGTGATAATTCGTTAACTAATTCAACAAACGGGAC 

CCTTCTGTGTGCCAGAAACCGCAAGCAGTTGCTAACCCAGTGGGACAANGCGGATTGGAA 

GAGCGGGAAGGTCCTGGCCCAGAGCAAGTGTGACACTTCCCTCTTTGACCATGAAACTCT 

NGGGTGTCTGCATTGCTGATGGC 

Sequence 937 

GTCCGCCGGCATGAGCTGTCCATGAAGGATGAGCTGCTTCAGTTCTACACCAGCGCTGCG 

GAGGAGAGTGAGCCCGAGTCCGTTTGCTCAACCCCGTTGAAGAGGAATGAGTCGTCCTCC 

TCAGTCCAGMTTACTTTCATTTGGATTCTCTTCAAAAGAAGCTGAAAGACCTTGAAGAG 

GAGAATGTTGTACTTCGATCCGAGGCCAGCCAGCTGAAGACAGAGACCATCACCTATGAG 

GAGAAGGAGCAGCAGCTGGTCAATGACTGCGTGAAGGAGCTGAGGGATGCCAATGTCCAG 

ATTGCTAGTATCTCAGAGGAACTGGCCAAGAAGACGGAAGATGCTNTCCGCCAGCAATGA 

GGAGATCACACA 

Sequence 938 

CCCGCGTCCGGMTTCCAGTTGTGGATGAAGGAAATGGTGTTATGACTGCCTCAAGGTTT 

TGTAGCAAGTCATAGGGAACCAAAAGAGGAATCTTGTTTTCCTCAGAGGTCATGCCAACT 

CCAACTCCCGTTC CCTAA ACTGTCTCTGAGCCATAGACTAGTAATGGACTCTTCAAGCTC 

TACCATTAGGTATCTTTTAMGAMGCTGGTTATTACTATTTATTCA I Mil I ICTCTTC 

TGTGCAGTGCAAAAGATATGAAACATCGGCTAGGTTTCCTGCTGCAAAAATCTGATTCCT 

GTGMCACAATTCTTCCCACAACAAGAAGGACAAAGTGGTTATTTGCCAGAGAGTGAGCC 

AAGAGGAAGTCAAGAAATGGGCTGAATCACTGGAAAACCTGATTAGTCATGGAATGTGGG 

CTGGCAGCTTTCAMGCTTTCTTGAAGTCTGAATATAGTGAGGAGGAATATTGACTTCTG 

GATCAAGCTGTGNAAGAGTACAAGAAAATC 

Sequence 939 

CGTCCGGCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTG 
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CCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGA 

CCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGATGGAGGAGAAATACGGCG 

GGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCG 

CTCG ATTAACAAAATATATTGTGTTACTATGTTTCACTAAATTTTTGAAGGCTGTGGGAC 

TTTTCGAATCATATGATCTCCTAAAAGCTGTTCACATTGTTCAGTTTCATTTTTATATTA 

AMCTTGGGACTGCATTTTTTATGGTTTTGTTTC 

ATTACCAAACACCAGATAATTGGATCACTAAAAATTCCTGGTAGAAAAGAATTTAAAGAC 
AAA 

Sequence 940 

TCCGAAAGNGTACTGCCATGANCCGAGATAGGAGACACATAAGAGGACAGCAGAAGCCCT 
GGCCCTGGGGAGGCTTCTCGGAAGGCCTGGCTTCACAGGCAGGCCACAGAAGGATATCGC 
GGGCACCGTGCACCCAAAGCMGATAGTGGCTTCCCTTTTATATCCAATCTAATCCTGAT ' 
TGGATGTCCCTGAGGCCCCTGCTGGAAACAGCCATAGGAGAGGGCCCATGGCAGTAGGGG 
AAAGMGGMGAAATTCCCTGCMCAAMCTTCAGCTAMCTTTGATTTGTGTATTGTTT 
ACATAATAATTTTAAAGGGTACATAATGTGTAAAGAGTTTGGATAGAACCTCTCTTCATA 
CTATGGTTTTCGTAAAGGATCTGTTGTTGTTACGGATTC A I 1 I I 1 1 CCCTCT ATTTTTAT 
AAAGAGCAGCAGAGTTGTCTTCTCAAAACGGCTGCCAAGCTCTGCTTCTTGGGAAGAT 
Sequence 941 

CCCGGTCGTGCGGGTCGGGCGCGGGCGGGCGCGGCGGCAGTGGCGCTNTCAGGTGATTGA 

CTGGCCAGCTGCCTGAAGGAGCGCCAGGTCCTCCTTGCTGGCAGGTGGCGAAGCCCATTG 

GGGCGGCGGTGCAGACCCGCGGCGGCNGCTGCGGCGGTCTGGCTCGGGAGGCGTTCCTGG 

GGCCAAGGCCATGGCCCCGCGGCTGCAGCTGGAGAAGGCGGCCTGGCGCTGGGCGGAGAC 

GGTGCGGCCCGAGGAGGTGTCNCAGGAGCACATCGAGACCGCTTACCGCATCTGGCTGGA 

GCCCTGCATTCGCGGCGTTGTGCAGACGAAACTGCAAAGGAAATCCGAATTGCTTGGTTG 

G 

Sequence 942 

CACCCACCCCAGATGCCGCCTGGCACCAAGCGCAGCCGCCAGCTGCCGCACTTTCCACTT 

GTATTGATCACCTATNANNCCCGCGCANAACGGCTACGNCCGAGCGGACCGCGGCCAGCG 

CGCCAGCCCTTGGCACNCCCTNGGAGCAGAAAGGGCTCCGGGAGGAAACTCCTTGGGAGC 

GCCCTGTCCGGANTGCCCTTTGCTCTCTGCAGTGTGATTTCTTTCTGTTCTGGGAGGAGG 

AGGAGGAGGANGAAGAGGAGGANGAGGNAGAACGANANNCTGCCCTTCCAGAGGTTGGTG 

AGGGAGATCGCGCATGGATTTNAAAACCNACCTGAGGTTTCAGAGCGCAGCCATCGGTTG 

CNCTGCANGAGGCTAGCGAAGCGTACCTGGTGGNTCTGTTCGAAAGA 

Sequence 943 

GTCCGGTTTTGAAACAGAAATGTAGGCATTAGACTTCCTGGGCGGCAGACAAACCAAAGA 

GCGGAAATTCATGCAGCCTGCAAAGCCATTGAACAAGCAAAGACTCAAAACATCAATAAA 

CTG GTTCTG TATACAGACAGTATGTTTACGATAMTGGTMGCTTTCACATTTGATTTCT 

TCTGTTTTTCCAGTAACTGTGAAGGGAAATTGGTAGGAGGTGTTGTAACAGGGCAGGACC 

CAAATGGGAACGGGGGGATGACATTGGTTTGTCAGGTACCGAGCAAAGAGTGAGGATTTT 

GGAGTCTCCCTTCTGCTGCTCTGATGTTTTCCACATGCTTATTTCTTTGCCAGGCACTGG 

AGATGCAGTCAAGAAGTGGGAAGTGGCTCTTACTTCTAGTCTGTGTGTGTATAAGTCACT 

TMGATGGCCGTGTTGACTGCTTCTTTGGGAAATGCCCTGAATAGGAGCATGTAGGGGAT 

GCTTACCGAGGCTGGGGAAGG 

Sequence 944 

GCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGGCCCGACCCCGAGCCCGACC 

CCGAGCCCAAGCCCGAGCCGGAGCCCGAGCGAGACCCCGAGCCCGAGCCCGAGCGCGACC 

CCCGGTGCGGCGCGGCTACCCCGGCGGAGGCGGNGGGCGCGGGGCGCGCTCTGAGGCCCG 

GGGGATGCGCCGCCGCCTCGACCATGGGCGCCGCCGCCTCCAGGAGGAGGGCGCTGAGGA 

GCGAGGCCATGTCCTCGGTGGCGGCCAAAGTGCGAGCAGCCCGAGCGTTTGGAGAGNACC 

TGTTCCCAGAGTCACCCTGAGAACCGCAACGGCCGCAGATCACCTGCTGGCTGATGCCTA 

CTCTGGCCACGACGGGTCCCCCGAGATGCAGCCGGCCCCCCAGAACAAGCGCCGCCTGTC 
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CCTCGTTCTCCAACGGCTGCTACGAGGGCAAGCCTTCTCAGAGGAAGCCCAAGCATTAGG 

AAGCCCGCAGGC 

Sequence 945 

CGCGTCCGGCACGGGGGAGTCTGTGGTGGCCNGTTTACCTGGGCATCTGGCTGAGAGGAA 

GAAAGGCCAACCTGATCCTGAGGGGACCCAGACATATCCTTTGCACTGTCCCTAGAGGGG 

CGATGAGCTTTGCAGCATTAAAAMTGGTGAAGGGGGGAAATATTTTGAACCAAAGACCA 

AATGTTAGGCCGCCGTTATATTTGCAGAAGCTTTGAGAACCATGCGTATAGCCTCCTGCA 

TTCTCCCCTCTCCTAGGAGCTCTTTTGTCTCTGTCCTTACGAGGCGTCATACAGAGGCAG 

TGGGGTGGGCACAGATGAGCAGAGTGGATGGTTCGGTGGGTCCCCACGAGGGCGAGTGGT 

GGTCATATGTGATGGCACCGTGTTCACACACCCTCCTGTGTACCCCCCCAGGGTCACCCG 

AAGTCCCCACACGCTGGCTCTCCACACCCCTCCTGTTCCAGAAAGCATGTCCCG 

Sequence 946 

TCGACCNCGCGTCCGGCACTCX)CTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCC 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACC 

CCTGCCTGGCCCCGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGC 

TGGGTCTTCGCCCAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAAT 

TTMCTTGGAAATCAACTAATTTCMGACAATTTTGGAGTGGGAACCCAAACCCGTCAAT 

CAAGTCTACACTGTTCAAATAAGCACTAAGTCAGGGAGATTGGAAAAGCAAATGCTTTTA 

CAC 

Sequence 947 

ACCCCGCGTCCGCTTTTGCATCTGGATCATTTTTCTCTTGCCCCACCATGTAAGAAGTGC 

CTTTCACCTCCCACCATGAACCTGAGGCCTCCCGAGTCATGTGGAAATCGCCCCCAGCCA 

CCCCACCCCAGAGGGCTACGTCCTGGCAGAGCTGGGTTTGGTTAGTTCTGAGGGCTGAGC 

TGGCCCAGCAGCTCCAGACCTCCAGACCTTGCACTCACCTGTGAACTTGACTCTGCAAAC 

TCCTCCAAGATGCGCCCACCACACTCCCAGTGAACAACACCTACAGGAGCTTGGAGTTCT 

ATTCTCAGATACATCAGCTTCCACATTCCTGTGTGTCCCAGCTGGAGAAGCAAGAAGTCC 

CAGACCATGTGCTAAGCACACGTTGGGGTGGGGATGAAATCCAATTGGTGGTGTGTGAAT 

CCATGCTGGATTCATGAAGCTGAGGCCCAGAGGAGGAAGCTTTCTTAATCMCTTCTTAA 

CATG 

Sequence 948 

TAAMGCCATGGTNATTTGTGCACTGTGCAGTTTCTTATTAGCAAAGGTGCCAATGTTAA 

CAGGGCTACAGCCAATAATGATCATACAGTAGTGTCGCTGGCATGTGCAGGAGGCCACCT 

GGCAGTTGTTGAGCTTCTCTTGGCTCATGGGGCTGACCCTACTCATCGACTCAAGGATGG 

TTCMCAATGCTCATTGAAGCTGCAMGGGTGGCCATACTAATGTAGTTTCTTATCTGTT 

GGATTATCCAAATAATGTTCTGTCAGTTCCCACCACAGATGTGTCTCAGCTCCCTCCACC 

TTCTCAAGATCAGTCTCAGGTGCCACGTGTGCCAACGCATACACTTGCCATGGTTGTACC ■ 

TNCCCAGGAACCTGACAGAACTTCACAGGAGAACTCTCCTGCCCTTTTAGGAGTGCAAAA 

A 

Sequence 949 

CCACGCGTNCGGTCGGCCTGTGCGGCGCTGCGGCGGAGCGGGCCATGGCAGTGGGGAGGG 

GGCGAGTGTAGTGCTGCGCGGGGCAGGCGGGAGGTGATCGAGAGAGGCAGGGATGGGGGC 

GCCGGAGTGGAGCGGTTGCGGCGGNCTGGGCTGCTGACTGCGCACTTGGAATAGTAGCAG 

GCGGCGGCGGCGGMCGCCAGGCAGTGTATGTTTTAACTGGAAAAAGTCTTCCATGAAAA 

CCGTCACTTTTAAAAAAATMGGTMTGCCCATTCTGTTTTTTCCTAAAAAAGACCTGAA 

AAT GGGGG GGCCGAACACATTCCTTAGGGGCCCCCGGTGGNTTATTGAAATGTGCCTTTC 

MGTTTTTCATTAAATGCNCTCCTGGCTTATTGGGCAGGACCATTCCCTTTGAACAATCC 

TGGGGGCCGGGCTGGGAT7TCAACAAGAATTAGGCAATTCTTGGAATGGGCCTTCCAATA 

ACCCTGNTGGGGAATTTTTCCCNTTTTNGCCCCCAACCTTGGGGGMTITNATTATT^ 

MGNTTTGGGGMGGGTTACCCTTCNGGGGGAAANGCTTMCCCAATTTTTC 

Sequence 950 

TTNGGGAGTCGCCACGCGTCCGGCCGGCGACGGCGGCAAGTGGCGGCCCGGCCTGCAGGA 
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GCCCGACGGGGTCTCTGCCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGC 

GGCGAGACATGAGGAGACCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGAT 

GGAGGAGAAATACGGCGGGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCC 

GGTGGACGTACCCAGCGCTCGATTMCAAMTATATTGTGTTACTATGTTTCACTAAATT 

TTTGAAGGCTGTGGGACTTTTCGMTCATATGATCTCCTAAMGCTGTTCACATTGTTCA 

GTTCATTTTTATATTAAAACTTGGGACTGCATTTTTT^^ 

TTTCTTCTGGGAAAAC 

Sequence 951 

NNTCCGGAGTCGACCNCGCGTCCGCGGCTGCTGCCTGCTCTGGAGGCAGGCTGGGCGGTG 

GCGGCCGAGACTGGCGGGGGTGGACGCCCGGGCCGGGCTGCGCCCGCTTCTTGCAGCTGT 

GAATTCCTTTGGACAATTGATGATATTTATCATTGTGCCCAGTTTCTACAAATAAAAGAT 

GGGTGGATTATTTTCTCGATGGAGGACAAAACCTTCAACTGTAGAAGTTCTAGAAAGTAT 

AGATAAGGAAATTCAAGCATTGGMGMTTTAGGGAAAAAAATCAGAGATTACAAAAATT 

ATGGGTTGGAAGATTAATTCTGTATTCCTCAGTTCTCTATCTGTTTACATGCTTAATTGT 

ATATTTGTGGTATCTTCCTGATGAATTTACAGCMGACTTGCCATGACACTCCCATTTTT 

TGCTTTTCCATTGATCATCTGGAGCATMGMCAGTAATTATTTTCTTCTTTTCCAAGAG 

AACAGAAAGAAATAATGAAGCATTGGATGGA 

Sequence 952 

TCNCCCCGCGCCGGTTTTGATACAGMTGAMGTGCGTAGTATTTTCATTTTG 

TTGCCTTATACATATAGCAAGCCCTCAATAAATAAATATTGAATGAATGAATGAGTGAGT 

GMGMTTTGTTTATAACAGTCTGTCATCTTGATMCACTGGMTGTCTTTGGTTCTTCC 

ACTTCATCCTTTATGTTTTAAACTTACACACACCATTCTTACACGTCACTAAAGGAAAAT 

ACCAGTATATATTGGCTAAAA I I I I I I I II I GTTGTTCAAAACTGAAACTCAAATGCCTA 

ATTGGGCTAGGGGTCCTCTTAMGGAGGTTGATGTTTGTCAMTGGGTTATTTTTTAAAA 

GCAGTAG ATMTTG CTTATTTCAAGGCMGTAAATGAATTTAGACTAAGCTGTTCATAGG 

ATTCATCATTTTTTCCCCTCTCCCAAAGTAATTTGTAAGCCGTAAAC 

Sequence 953 

TCGCCCCGCGTCCGTGATTTCTCAGTGTTCTCCTTAGATACCAAATACAAAGGACGAGGG 

AT CMG CTCAGCGAAAGTATCAGGCATTTAAGGTATCAGGCAGCAATGCGGGGAAAGGTG 

MTTTTCTTCAATCAGCATAGGATGGTTAGGGMGAGCATTTATCACTTTGGTTCTTATC 

CTTCAAGCCAGGGGAAAAGCAACAGTGAGGACATCAGAGACAAAAGCATTTATAGAACTA 

ACAMCACAMCGTTTGACAAGTGAGAAAGCTTTATTAAAGCACACATACATGTCAGGGG 

GGTGGGAAACAAAAGAGCAAGTTACAGCCCGGGATCCCAAGTTATGCCTTCCATTACAAT 

TGCAATCCACACCAAATCAATCT7TGAAAACATTCCTCCATTCGGTTCATACATACAGTA 

GAAACCACTGTGGCTGCCCTTAATCCAGTGTGCTTATAGGAAATCAGTTAGCAGCTGACT 

CTGTTGAAAG 

Sequence 954 

CGTCCGGACCCTTATTAAGAATATCCCAGGAAGATGGTGATGAACAGCCTCAGTTTACTT 

TTCCACCAGATGAATTCACTAGCAAAAAAATTACAACAAAAATATTACAGCAGATTGAGG 

MCCATTGGCACTGGTGTGAACMTTMCCAGCAMTGTCCTTTTCTAATACCATTTGAA 

ACTAGACAGCTTTATTTCACATGTACAGCATTTGGCX3CCTCAAGAGCAATAGTATGGTTA 

CAGAACCGACGTGAAGCCACTGTGGAGCGAACGAGAACCACAAGCAGTGTTAGGCGAGAT 

GACCCTGGAGAGTTTCGAGTTGGTCG1 CtCAAGCATGAAAGAGTAAAAGTTCCACGTGGC 

GAGTCACTGATGGAATGGGCTGAGAATGTCATGCAAATACATGCAGATCGGAAATCAGTT 

CTTGAGGTTGMTTTTTAGGAGMGAAGGAACTGGCTTGGGACCCACATTAGAGTTTTAT 

GCTCTGGTG 

Sequence 955 

ACCACGCGTNCGGGCAGAAATACGGCGGCATGTTCTGCAACGTGGAGGGCGCCTTCGAGA 
GCAAGGACGCTGGATTTCGATGCCCTCAGCGTGGGGCAGCGGGGCGCGAAGACTCCTCGG 
AGCGGCCAGGGCAGCGACCGAGGATCGGGGAGTCGGCCCGGGATCGAGGGGGACACCCCG 
CGCAGGGGCCAAGGCCGGNAAGAGAGCAGGGAGCCCGCGCCCGCCTCCCCCGCCCCCGCC 
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GGGGTAGAGATCCGGAGCGCCACCGGCAAAGAGGTGTTGCAGAACCTCGGCCCCAAGGAC 

AAGAGTGACCGTCTNCTTATCAAGGGAGGCAGAATCGTCAATGATGATCAGTCCTTTTAT 

GCTGATATTTACATGGAAGATGGCTTAATAAAACAAATTGGAGACAATCTGATTGTTCCT 

GGAGGAGTGAAGACCATTGAAGCC 

Sequence 956 

CCCGCGTCCGCTA CTGTA CTTTGCAGTTTGATGTTTATTMCATTCTTTGGGCACCTAGC 

TACAATATAACTCAATTTTCTGTG AAAAACTATTAATCATCCTA 1 I I I I I CCTTGTCTTT 

MTATGAGATAAATTTTATACCACTGTTTCTCAAACCATCTG1TGTGAGGGACAGTTTTG 

CTTTTTMTTTCCAATTGTCAGAGACCMTACT1TTGTAAAATATAATTAAAAACAAACA 

TAAAAATAAACTTATTAGAAAAATGAAATAAMGAGAAATGAMTAAGAATMTTTTATT 

ATTAGATTTMCAGATCAAATATTATTTCAATACTCAGATCAAATGTGCAATAAGACAGG 

GTTGCAAAAAATGCACACTTTTTTTATTAAAATCATTTATATMGTMTTTATATAAA 

TMTATTACAGTTGCAACTTTCTGGTGNTTCTCAACTATGACCAAACAGGAGGGTACAAG 

TAAAGGAGCAATCCCAA 

Sequence 957 

GTCGACCACGCGTCCGCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGG 

TGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAG 

AGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCATG 

CTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTTA 

TGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTCA 

GCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAGGAGTGGTGAAG 

GGGACAGTGACACACTTCATGCTGGCTGTCCTGTCCTGGTGGGAGATAAGTTGGGTGGCC 

AACAAGTGGATACATGAGTATGGACAGGAATTCCGCAGACCCTGCAGCTNCAGC 

Sequence 958 

GTCGACCACGCGTCCGCGCCAACTCCGGAGGCGCGGTGCTCGGCCCGGGAGCGCGAGCGG 

GAGGAGCAGAGACCCGCAGCCGGGAGCCCGAGCGCGGGCGATGCAGGCTCCGCGAGCGGC 

ACCTGCGGCTCCTCTAAGCTACGACCGTCGTCTCCGCGGCAGCAGCGCGGGCCCCAGCAG 

CCTCGGCAGCCACAGCCGCTGCAGCCGGGGCAGCCTCCGCTGCTGTCGCCTCCTCTGATG 

CGCTTGCCCTCTCCCGGCCCCGGGACTCCGGGAGAATGTGGGTCCTAGGCATCGCGGCAA 

CI 1 1 I IGCGGATTGTTCTTGCTTCCAGGCTTTGCGCTGCAAATCCAGTGCTACCAGTGTG 

AAGAATTCCAGCTGAACAACGACTGCTCCTCCCCCGAGTTCATTGTGAATTGCACGGTGA 

ACGTTCAAGAC 

Sequence 959 

CCACGCGTCCGAGGGTGGGGAAAGGAGGAGAGGAAGAGCACTCCCTTCCCTGGCCCCTCA 

TCCAGCCTCCGGTGCTGTAAAACGCAGGCGCTGGGCCGCGGGCGGAGCTGAGGACAGGCC 

TTGGCTGGTCCCAGGATGAGCGACGAGTTTGGTTTTAGCTGGGGATTGTGCTGGCATCCT 

GCGAAGCTCCTCCCAGCCGGTCTCTCTGTGCTCGGTTGTCTTGGGGTGGGGCCCATCCGC 

CGAGGTGGGGACCGATAGGAGAAGCCGGTGGGTTGTACCCTTACACTTGTGGAGTCTCCT 

CTTGCCTCTACCTACTCCGCCTTTGTCCTTAAGGTTTTTGCAGGCCAGTGCCAAACACAC 

ACTAACTGTCCTGGCCTCTCCGTGACACAAGTCTCTTCCCAGCCTTCCTC 

Sequence 960 

CCACGCGTCCGCGGACGCGTGGGGCCGGGACAACTGGTCTTATCACGGAGGCTGGGGCCA 

NGGCAGCCCTTCGGTTCGGGTGGGCCCATGGACCCCAGTCCAACGCCGAGGGAATAGGAC 

CATCCAAAAGCGGAACCTTCGCCTCAGAAAAAGGGTGCGGGACCCCTCCTCACCGTGCGG 

TCACGCGTGGACCCTGCCAGCAGCCAGGCCATGGAGCTCTCTGATGTCACCCTCATTGAG 

GGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTNGGTTGGTNGCTGATTCTAGCCTTG 

GTCCTAGCTTGGCTCTCTACCTACTTAGCAGACAGCGGTAGCAACCAGCTCCTGGGCGCT 

NTTGTGTCAAGCAGGCGACACATCCGTCCTNCACCTGGGGCATGTGGACCACCTGNTGGG 

CAGGCCAAGGCNNCCCCGAAGCCAACTGA 

Sequence 961 

NCCCCGCGTCCGGGAGGCTCCATGTTGTCCCCTCAGCGAGTGGCAGCAGCTGCCTCAAGA 
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GGAGCAGATGATGCCATGGAGAGCAGCAAGCCTGGTCCAGTGCAGGTTGTTTTGGTTCAG 

AMGATCAACATTCCTTTGAGCTAGATGAGAAAGCCTTGGCCAGCATCCTCTTGCAGGAC 

CACATCCGAGATCTTGATGTGGTGGTGGTTTCAGTGGCTGGTGCCTTCCGAAAGGGCAAG 

TCCTTCATTCTGGATTTTATGCTACGATACTTATATTCTCAGAAGGAAAGTGGCCATTCA 

MTTGGTTGGGTGACCCAGMGMCCGTTAACAGGATTTTCCTGGAGAGGGGGATCTGAT 

CCAGAAACCACTGGGATTCAAATCTGGAGTGAAGTTTTCACTGTGGAGAAGCCAGGTGGG - 

AAGAAGGTTTGCAGTTGTTTCTGATGGATACCCAGGGGGCAT 

Sequence 962 

GCCCCGCGTCCGCTTCTCCGAATATAGCAACGTCCAGCAGTGTCCACACTGTGGGAACCT 

GGACTACCACTTCGTGAAGCCATTTTCCTCCTTCAAAGTTCTCGAAGCTTATTGATGAAA 

GCTTTGCTTTAGTMTAGCTATTrrATTGATATTATTACTTTATTACATATCTT^ 

GGAMCATTCTGTGACATTAATTTCCTTTCTAATTTAAAGGAGAGTTACTTTGTTG 

TGTGCCACTAAAATAGGGGCTGCCCTTGCCCTGTCTTGATTCCCGAGTGTTAATCTGTGG 

TTTTGACCAGAGCCCAGATGGGTMTCCTGTGCATTTGGGTTGGGGGTTCACTCTTACCA 

AGMTCTTTGATGCAGCTTTAAGATGGTGGGGAGATGGGGGTTGAATTTAGGGAAAGAAT 

NTTTGTGGGTTATAAACTAAGAGCTTGATAGGAGTTGGAAGGAAACTCTTACTAAAATGT 

TMCTITCTAAAAACCTTCTTTTANATCTTNCTTGGGCCTTTGGAAAA 

Sequence 963 

GTGTTTTGGGGATGCCTTTCCnTACCAGATTCTTCTAAAGCCCAGCTGCACCCACCCTTA 

AGTGGGAGATAAGGCTTCTGCCCGCGGGCTCTGCGTTCGTCCACCCGGCCCCACGTTTGC 

TGTGGACTAAACAGGAGCCACTGGACTAGAGTACACTTGACTCTCGGCTCTGCGGACCAA 

AAATTCCAGGACTAAGGAATAGCAAGGTTAGGCTGAAACAGTCCACACAGGGCTTGCGGT 

AMCGTCTTTTCAGGAGCCACTCGCCCAGTGCAGTAAGTCGTGTACTTAGTTGACTCGAG 

CGCTCCAGGGAGACGCCCGACCCTACTCTGCGCCGCCCCGGGGCACCAGCTCTGCTTCCT 

CCAGGTCCACTGAGGCAGGCACGCCCAGCTCTGGGACAGGTCAGTAAACAAGCCACGAAC 

CGCGCCAGGGATCAGAGAACCCANAGTCCCCGCCAGCTGCCGGCACAAGCCAATCGCAGC 

GCANCCAGGCGGC 

Sequence 964 

GTCTAAGGGATCCAGGTCCTGTGTCTCAGGGACCTCTGATGGGATTGAATCCAAGAGGAA 

TGCAGGGGCCTCCAGGCCCNCGGGAGAACCAGGGTCCTGCTCCCCAAGGGATGATTATGG 

GCCACCCGCCTCAAGAGATGAGAGGACCTCACCCTCCAGGTGGACTACTGGGACACGGCC 

CTCAGGAAATGAGAGGTCCTCAGGAGATCCGAGGCATGCAGGGGCCTCCACCCCAAGGAT 

CAATGCTGGGACCTCCCCAGGAATTGCGAGGGCCTCCAGGCTCACAAAGTCAGCAGGGGC 

CGCCCCAGGGCTCTTTAGGACCTCCACCCCAGGGTGGCATGCAAGGACCCCCCGGACCTC 

AGGGACAGCAGAACCCAGCAAGAGGGCCACATCCATCTCAAGGGCCAATACCATTCCAGC 

AACAGAAAACGCCTCTGCTAGGTGATGGGCCCCGGGCCCCCTTCAACCAGGAAGGACAGA 

GCACAGGCCCCCCACC 

Sequence 965 

TGCGCATGCGCGGAGCGCGGCGCGCGCGGCGGTTGGGCCGTTGGCTGTTCGGCCCTGGGA 

TCCGCCGCCACTCCGCGATCAGACCGCTCTGTGCCGCGAGCCGCCGTGAGCACTCGGATT 

CAAGCCGGCGCCAACGAGTCCGGGGGCATCGCCCGCAGCGGCCAAGCTCATGGCCGGCTG 

AGCGGGACGCCGCCTNCGCCTCAGCCACCGCCGCCGCCGCCGNCTTCTTCTCCTCAGCCG 

GCGGCGGCCCGGGCCCAGCAACCATGGCTGAAGACTACTGGGACGGGCGCCTGCGGCGAA 

CAGGAGGAGAAAGGGAGGTCGCGCGGCCTCATTCCGGGCCGCCGCCCCAGGCGCCGCCGC 

GCCGGCCCCGCGGCTCTGAGGTTGCTCGCGCGCCCCC 

Sequence 966 

TGGAAMTNTTTTGGMAMATTACCCTTGGGACCTTGNTTTTNAANCCCNAGGTTCCCN 
GTTNNGGCAMTAAAANAATGNNNGACCCGGGATTTNGGGNTTNNAAACCGGGGGTTnT ' 
AATTTCCCCNNNNCNNGGGNCC I I I I I I I I I NCCNCCCCCNCAAGGGGGNTTTGGGAAAN 
NAAANCCCCCCCC I I I II I I I I NGGGGGNGAAANTTCCCCGGGTNNNNGCCN I 1 1 I I 1 I 1 
TTTTTAAA 
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Sequence 967 

GTCCGCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGAAAGATG 

AGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGAACGCATTTACTGTC 

ACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAATGCAAA 

TTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATGGAGGAT 

AAGMCATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGTAGCTAC 

AGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTTCAGATC 

ACAGATGTGAAATTGCAGGATGCAGGGGGTGTACCGCTGCATGATCAGCTATGGGTGGTG 

CCGACTACAAGCGAATTACTGTGAAAGTCAATGCCCCATACAACAAAATCAACCAAAGA 

Sequence 968 

CGTCCGGGAACTCAGCAACGGTTTCTTCATCCAGGACCCGATTGCTCTGGTGGAGAGGGG 

GGGCTGCTCCTTCCTCTCCAAGACTCGGGTGGTCCAGGAGCACGGCGGGCGGGCGGTGAT 

CATCTCTGACAACGCAGTTGACAATGACAGCTTCTACGTGGAGATGATCCAGGACAGTAC 

CCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGGCCGAGACGGCTACATGATCCG 

CCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTTCCATCCCAGTCAATGTCAC 

CAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTCTGGTAGAAGAGTTTGTC 

CCACATTCCAGCCATAAGTGACTCTGAGCTGGGAAGGGGAAACCCAGGAATTTTGCTACT 

TGGAATTTGGAGATAGCATCTGGGGACAAGTGGAGCCAGGTAGAGGAAAAGGGTTTGGGG 

CCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCTTCCCCAGGG 

Sequence 969 

GATTGGAGGAGTCACATCCCCTCTTCAGCCGCAGCACCCCTCCCTCCCATCCTCTAGCTC 

TTCCCGCGGTGGTCGCCTCCCTCCGACCCTGCTCTCCCCTCCTGGGCCCCGCGCAAAGCC 

CCCTCTGTTCCAGCTCCCGGGCCTCGGCTGCCTGCCCGCCCTCCCATCCCTTCCTCTTCC 

CAGGGCCTGGAGCGCTCCCTTACATTCCTGAGATGCCCTTCCTCGGGGCTGTCCCCCTTT 

GCCTCCCCAGCATCCCATTTCTAGGCCTTTTTCMGACCCTTCCAGAGCGGCCCCTTTCC 

AGCTCCCTTTCTCGTTTCCATTTCCAACmGCCTCTTTTGCCTCTTTG 

TCCMGCTCCCCTCCCTCTTTCCCTTCGTCTTACCCTGCTTTGATCTACGCAGCCCCAAA 

CTCAAGCTCCCCGCTTTCAAGGTGGTGCGAGGTTGTTGGGGGGTGCGGAAGGGCCTGCCA 

AGTCCATTTTTCGAGGGG 

Sequence 970 

GTCCGAGATCGCGAGCCGCCGCCC I 1 1 1 1 I I I I I I I I I I ATAAGATTATTAGTATAAAAN 

GGGGAGACGAGGTTAGGGCCCTGGGAAAGGTGGGAGATCAGCCAGAGACAGGTTTCCCAG 

AACAGAATGTCTGGCCTTTGTGGTGAGGAGGGACTGTGGTATGAGCCGCAGAAGCGGGCC 

AGGGGTAAACCCTCCTGTGCGTCCTTCCTTCAGCCTGGTCCTGAGGGTGACCCTTTGATC 

CTGGGTTCTCCAGGTAGGGCTGTGAGCTGTGAGTTGGATCCTTTTGGTGAAATGGTCTCT 

CTCATCTGGCCTGTCACTCAATGTGGAATAGAGTGAGTGAGTTCTATGGGTTCTAAGTCC 

TGCTCTGGMCCATAAGTMGTTATCCTCTCTGGGCTTCAGTTTTTCATGGAAAGTTGCG 

TTMGMTCTAGTTTAAGGCCAGGCATGGTGGCTCACCGCCTTGTAATCCCAGCACTTTG 

GGGAGGCCAAGGAAGGTGGATCATGANGTCAGGAGATCGAGACCATCCTNGCTAACATGA 

TNAAACCCGTGTCTTTACTTAAAAAATAC 

Sequence 971 

CCTGCCAGTGGTGAGCACCTTCGGCCTCCAGGTGCCTTTCTTCTTCTTCGCGGCCATNTG 

CTTGGTGAGCCTGGTGTTCACAGGCTGCTGTGTGCCCGAAACCAAAGGGACGTCCNTGGA 

GCMATCCGAGTCCTTTTTCCGCACGGGGAGAAGGTCCTTCTTGCGCTAGGTCAAGGTCC 

CCGCCTGGAGGGGGCCAAACCCCCA 

Sequence 972 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGTCAGCCTCCACCTGGAAGAGAGCTANGGG 

CCGGGCAGGCCGGGCAGCTGCCACCCCGCCCGGCCCGACGCCCCGCATGCCCCGAAGTCC 

CTGGCGCCCACCCGGCCGCGGCCCTGCGTGTGACCCGCGGGTCGATACCTGGCAGCCCCA 

GTGCTGGGGCGCCGCGGCCCTGCTCGCCCAGGAGGAGAGCGAGGGCCCCACACTGAGTCT 

CTTGAAGCCTCACGTTTCCCTGGGGGGGTGCTGCATCGTCGGGTGTCCCTCACCCCACCT 
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GGGGAACCTCTGTCTTCAGGTCACCCCTTTTCAGGGGCCTGG 
Sequence 973 

CGTCCGGGACCCTGCTCATGGAGAACATCAGCAGCTGGCTGCTCCTTCGCTGACGCCCTG 

GGCTACGTGAACCTGCCGCTCACCTTTTTCTGCCGGGCAGAGCTGGATAGTGAGCCCGAG 

CGGGTGGCGTCCGTCCTGGAAAAGCTGAAGGAGGACTGNAACAACACTGAGAACAAAGAA 

CGGAAGTCCTTNCAGAAGGAGCTTGTGATGGCCCTACTGAAGATGGACTGCCAGGGCCTG 

GTGGTCAGACTCATCCAGGACTTTGTGCTCCTGACCACGGCTGTAGAGGTGGCCCAGCGC 

TGGCGGGAGCTGGCTGAGAAGCTGGCCAAGGTCTNCAAGCAGCAGATGGACGCCTACNAG 

TCTCCCCACCGGTGACAGGAACGGGGTTGTGGACAGCGAGGCCATGTGGAAGCCTGCGTA 

TGACTTCTTAC 

Sequence 974 

TCACCACGCGTCCGCGAAGCGTGCACCGCTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCC 

GCAAGCCCAACGCCGGGGGCAGCCCCGCTCCGGTGCGCCGCCGGNCGGAGGCCTCGCCGG 

TGCAGAAAAAGGAGAAGAAGGACAAGGAGCCGGGAAAACGAGAAGGAGAAGAGTGCCCTA 

GCCCGGGAGCGCAGCCTCAAGAAGCGCCAGTCGCTGCCCGCCTCCCCACGTGCCCGCCTC 

TCTGCCAGCACCGCCTCTGAGCTCAGCCCCAAATCCAAGGCCAGGCCATCCTCTCCX^TCC 

ACATCCTGGCACAGGCCTGCCTCCCCCTGCCCCAGCCCAGGGCCAGGCCACACTCTGTCT 

CCAAAGCCACCGTNCCCCCGAGGCACCACTGCATCCCCCAAGGGGCCGGGTTCGGAGGAA 

GGAGGAGGCAAAGGAGAGCCCCAGCGCCGCANGGCCCGAGGACAA 

Sequence 975 

TCCGCAGAAACGGACTTTCTCATCATGCTTTCCTATGGTGGGTATGAGGGGCCAGCTGAT 

ACCAACCAACTGGCCTGTATCTATCTATCTGGATTTGACTTGAATTTTTAAAATGTGTAT 

CGTTTAAAAAAAMAATGTTTGCAMTTTTGCACATAGGATCTTGCACTGTTCATTTTCA 

GTGGGGTGAGTCTTCACACTAAAAACACAAGCAGAGCTCCTGGGAAAAGAGACTGGAAGT 

GGTTCAGGATAAAGAGATCCATGGTGGGCAGGGCTCTTAGGTCACAGAGCTCTAGAAGCA 

GCTGGACTTGMCCCACAATGGCTTGTGTAMTTCGTAMTTTCATGGTTTCTAGGAAAA 

GCTGCATTG 

Sequence 976 

GCGT CCGG MGMCTGTGGAACTGCGGTTGGCAGACMTTTTGTTACAAATATTAAAAGG 

AMGATTTTGCCAATATGACCAGCTTGGTGGACCTGACTCTATCCAGGAATACAATAAGT 

TTTATTACACCTCATGCTTTCGCTGACCTACGAAATTTGAGGGCTTTGCATTTGAATAGC 

AACAGATTGACTAAAATTACAMTGATATGTTCAGTGGTCTTTCCAATCTTCATCATTTG 

ATACTGAACAACAATCAGCTGACTTTM7TTCCTCTACAGCGTTTGATGATGTCTTTGCC 

CTTGAGGAGCTGGATCTGTCCTATAATAATCTAGAAACCATTCCTTGGGATGCTGTTGAG 

AAGATGGTTAGC7TGCATACCCTTAGTTTGGATCACAATATGATTGATAACATTCCTAAG 

GGGACCTTCTCCCATTTGCACAAGATGACTCGGTTAGATGTGACATCAAATAAATTGCAG 

AAGCTACCACCTGACCCTCTCTTTCAGCGAGCTCAGGTACTAGCAACCTCAGGAATCATA 

AGCCCATCTACTTTTGCATTAAGTTTT 

Sequence 977 

NCTCCMCMTTATGGCTCATCCTTCCTTTTACTCTGTCTCACCTCCTTTAGGTGAGTAC 

TTCCTTAMTAAGTGCTMACATACATANACGGAACTNGAMGCTTTGGTTAGCCTTGCC 

TTAGGTMTCAGCCTAGTTTACACTGTTTCCAGGGAGTAGTTGAATTACTATAAACCATT 

AGCCACTTGTCTCTGCACCATTTATCACACCAGGACAGGGTCTCTCAACCTGGGCGCTAC 

TGTCATTTGGGGCCAGGTGATTCTTCCTTGCAGGGGCTGTCCTGTACCTTGTAGGACAGC 

AGCCCTGTCCTAGAAGGTATGTTTAGCAGCATTCCTGGCCTCTAGCTACCCGATGCCAGA 

GCATGCTCCCCCCGCAGTCATGACAATCAAAAAATGTCTCCAGACATTGTCAAATGCCTC 

CTGGGGGGCAGTATTTCTCAAGCACTTTTAAGCAAAGGTAAGTATTCATACAAGAAATTT 

AGGGGGAMAMCATTGGTTAAATAAAAGCTATGTGTTCCTATTCAACAATATTTTT 

Sequence 978 

CCCCGCGTCCGGGTCCCCGCGACTCCCGGACTGGAGAAAACGGCTCTTGCGATGGGGCGA 
AGTCCGAGCTGCGGCGGGCGTTGGTCCGTGCAGGGAAGTGGGAATCGTTAGGTTCGTTCT 
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GGACCCGCCGCCCCATGGCCCAGGCGTCTCGCTCAGGTAGCCTGCCTCCACTCGTTATCG 

TGCCCCCGCTGAGGGCGCAACCCGGGGGCACTGGGGAGGAGCAGTGGGAGAGAAGTCGAA 

CGGNCGGNCTTCCGCTGGGCAGAGCTCAGCAGTACTTGGCAGCATGGGACCCAGCTTCCT 

TCCTGCTCCTGATCCAAAAGGACTTACCTNCTCTGTTGCATGAGGCAGAAGCTTTGTATA 

GCCTGGCCTCAGAGGAAAGCTTAGCTCTGGAAGTGGAGCAGCAGCTGGGCCTGGAGATCC 

AGAANCTGACTGCACAGATCCAGC 

Sequence 979 

AGGCTGNTACGAAGCGAGCTTGGGAGGAGCAGCTGGCCTGCGGGGCAGAGGAGCATCCCG 

TCTACCANGTCCCAAGCGGTGTGGCCCGCGGGTCATGGNCAAAGGAGAAGGCNCCGANAG 

CGGCTCCNCGGCGGGGCTGNTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 

GGTGAAGAAAGAACCGAAAAAGAAGAMCMCAGTTGTCTGTTTGCAACAAGCTTTGCTA 

TGCACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCANAT 

CTACCTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTTCATCATCCTGNTTGTGGG 

CCGANCCTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATNCCC 

Sequence 980 

ACCCCGCGTCCGGAAAAGAAGAGTGGCCNGTTCCAGGGGTAGCTCCAAAAGAGACTGCAG 

AGCTGTCCGAGACCCTGACAAGGGAGGCCCAAGGCAACAGTTCCGCAGGAGTGGAGGCAG 

CAGAGCAGAGGCCTGTGGAAGATGGCGAGAGGGGCATGAAGCCAACAGAAGGGTGGAAAT 

GGACCCTGAACTCCGGGAAGGCTCGAGAATGGACACCCAGGGACATAGAGGCTCAAACTC 

AGAAACCAGAACCTCCAGAGTCAGCAGAGAAGCTTCTGGAATCTCCCGGTGTGGAGGCTG 

GAGAAGGGGAGGCTGAGAAGGAGGAGGCGGGGGCTCAGGGCAGGCCTCTGAGAGCCCTGC 

AGAACTGCTGCTCTGTGCCCTCCCCCCTCCCACCAGAGGACGCTGGGACTGGAGGCCTGA 

GACAGCAGGAAGAGGAAGCAGTGGAGCTTTCAAGCCCCCACCACCAGCCCCTCTGTCTCC 

CCCACCCCCAGCCCCAACTGCCCCCCAACCTTCTGG 

Sequence 981 

GCCCCGCGTCCGAAAAGAATGGGTGAACCAATCGGCCTTTGTGMTTTATTCAGTGCCTT 
CTCTGTACCAAGCACTGGGTAAGGCACTTTTGTGGAGCATTAGACAGTAACCCTCAAGGA 
GCTA GAGAA CCGGATGGGAGACATGAGCGGTAATTAACTCACTTGTTCCCCAGAGTTTCT 
ATTTGTTTTGATTTTCTTmCTGTGACTTATTTTC 

TTCACTATGGCCCMCTMTATAMCACCTGGAMTTACAAGGAAAAAAAATTCTTCCTC 

TMTMCTTTCCAM7TTGTGGMTATTTATTTGTAATAGCAGTTATCAGTTATGCTTAT 

ATAGCATTAAAAATTCTCCTCCTTTGACTACACACACAACCACAGTGTGGTTCTAATCAT 

GGAGATATCAGTMTTTTTAGTMCTGMTTTTGAGGACATTTCTNTGTTTAGCATGTAT 

GCAAACTGATATGTAATCTGAGGTTCCAAAGTCA A I I H I I I CI I < I I I 

Sequence 982 

TNGGGAGTCGACCCCGCGTCCGGTTTTTGTGAGGCAGTGAGACCTAAGGTAACCTTTATC 

AAAAGGATGGAGTTGGGAAAAGGAAAACTACTCAGGACTGGACTGAATGCGTTGCATCAA 

GCAGTGCATCCGATCCATGGCCTTGCCTGGACCGATGGGAATCAAGTTGTCCTAACTGAT 

TTGCGGCTTCACAGTGGAGAGGTCMGTTTGGGGACTCCAMGTCATTGGACAGTTTGAA 

TGTGTCTGTGGGTTGTCCTGGGCCCCACCTGTTGCAGATGATACACCTGTTCTACTCGCT 

GTCCAGCATGAGAAGCATGTCACTGTGTGGCAGCTGTGTCCCAGCCCTATGGAGTCAAGC 

AAATGGCTTGACGTCTCAGACTTGTGAGATTAGGAGGGATCACTACCTATCCTTCCCCAG 

GGCTGTGTGTGGCACCCAAA 

Sequence 983 

GTGTCGACCCCGCGTCCGCGCCCTGCCTGCAGTTGAGATTCAGATGCCTTCTGACAGAGT 

TCAGCCTCTTGGAGAGTCTTGGGGATTGTTGGCACCTAAACAGAATCAGNGACCCGGGTG 

CTTTGTGGCCAGCAGCACAGAATCAAACCCGCATCCCAGCATTGGGCCACCCATCTGAGG 

GAGGCCAAAATCATCACAGATGCTGCTGTGCTGCAGACAGATACATGCTAGTCCAGAGAG 

CCGCCCCTGAGATGGCTGTGAGAACCATGTGTCTAAGGCGTAAGATAAGGATGGAAGGCT 

GTCCMGTTATTTGGAAGGCCTCGGCAGCTTGGGATTAGCTTGGGAGCGCAGCGCTGCAA 

AGTGGAAAATATGAAAAGACCACACAGGCCCAAGCAGTCCAGAAACTGGGCAAAAATATT 
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CTGCAGTGGGGATTT A I Mi ll 
Sequence 984 

CACG CGTCCG GAGTACGGAGTTGTTCCTTTACTGGCTGAAAGATATATTCGAATTGTAAA 
GATGCTTTTTCTCATGCATTGAAATTATACATTATTO 

TTTTTTTCTCCCGAGACAGGGTCTTGCTCTGGCGCCCAGGCTGGAGTACAGNGGCATGAT 

CTTGGCTCACTTCAGCCTTGACTTGGGCTCAAGTGATCCTCCTACCTGAGCCTTCTGAGT 

MCTGGGACTACAGGTGTGCACTCCTCGCCTGGCTMTTTTTTATTT7TTGTACAGGCAG 

GATCTTGCCACCTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCATGCCATCTGCCTGCCT 

TAGTCTCCCAAAATGCTGGGATTACAGGAGTGAGCCACCATGCCCGGCTGGCAGTTGCAT 

GGAAGAGMCACCTNTTTATGGCTTACCCTCTAGMTTTCTMTTTATGNGNNCTGTTGA 

AATTTTTGGI M i l l I ACCT 

Sequence 985 

GTCGACCACGCGTCCGCTCGGCTTCGTGCTGATGGTCAGGGTTTTGGCAACTCCCCGGTG 

TGAGAGGGGTAGGGAGTGCTCCCGGCGGCGACGGGGCCGAGTTCACCAGCCGCCGGGGCA 

GTAGTCGAAGGCCCGGCGCGGCATGTCCTGGGTGCCGCGGTGCGGGCAGTGAACGCGCGC 

CGGGCGGGATGGGCCGGCGCCGGGCGCCAGAGCTGTACCGGGCTCCGTTCCCGTTGTACG 

CGCTTCAGGTCGACCCCAGCACTGGGCTGCTCATCGCTGCX3GGCGGAGGAGGCGCCGCCA 

AGACAGGCATAAAGAATGGCGTGCACTTTCTGCAGCTAGAGCTGATTAATGGGCGCTTGA 

GTGCCTCCTTGCTGCACTCCCATGACACAGAGACACGGGCCACCATGAACTTGGCACTGG 

CTGGTGACATCCTTGCTGCAGGGGCAGGATGCCCACTGTCAGCTTTCTGCGCTTTCAGGC 

ACATTAACAGCA 

Sequence 986 

CGCCACGCGTCCGCGTACGCGTGGGCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTG 

GGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTG 

GGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCA 

CAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACC 

ATTGATAGAT7TMTMCTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGAC 

TACTTTAGGTATTACMGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAG 

CCAGTGTGGAAGAAGGGACT 

Sequence 987 

GGTCGCCCCGCGTCCGTAGCAGTTACATCTACGAGGCTATTATGGATTGGAGGATGAGAA 

GGGAACTG CATGT ACCTCAACAAGGCGTCGGTCAACACCGCGAAGTTTGGCAGGCTTGAC 

AAGTGGAGTTTTTGMTCTATMTGGTTCMGTTTTGAGACAGGAAGAACAGCTGAGAGC 

AAAAGAAGAAAAAAGGCTTCGGGAGCAGGAAAGAAAAGAAGCAGAAGAAGCTAGTCAAAA 

GGAAATAGAAGAATGGGAAAGAAAACTTCTAGCTCAAGCAGCTCCAACTTGTATGGAGAC 

CATGTGGGAAATT CCAG CTATTGGGCATTTCCTTTGTTTAGCTCAGCAAATTCTAMTTT 

GCCAGAMTAGTCTTTTACCGAACTGGAACCGTTGTCTTCTGATGCCTCAGTGTAATGCT 

TTTCTATCGAAAATAATGACTTCTTATTTAAGTCCTCCCCATCGCAGA 

Sequence 988 

NCCCCGCGTCCGAGTCCCCTGTCTGTGGCACCAGACACTCCCGACTGTGCGCTGACTCTC 

CCCGCCCAGCCAGCAGCCTTTTCCAGAGAGGCTGTGGTCCATAGCCTCTGTTCGTTTTCA 

CTGCAGGACCAGGCACGAMGTTMAACAAAATGAAGATriTTTCTGAATCTCATAAAAC 

AGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATGTCGAGTT 

TGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATATCTAAATC 

ATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATGATATATT 

TCCTTTCAMMGCTGGTGAATTTTATTGTGAGTGACTCTGGAGCACATGTTTTAAATTC 

TTGGACTCAAGAAGACCAAAATTTACAGGAGCTAATGGCAGCATTAGCCCGCTGTTGGGC 

CTCCTAATCCTCGGGC 

Sequence 989 

GTCGCCCACGCGTCCGTTCGTTGTCTGATGGACCTGCTTGCAAAAGGCCAGCTCTGTTGC 
ATTCCCAATTTTTGACACCACCTCAAACACCAACGCCCGGGGAGAGCATGGAAGATGTTC 
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ATCTCAATGAACCCAAACAGGAGAGCAGTGCTGATCTGCTTCAGAACATTATCAACATTA 

AGAATGAATGCAGCCCCGTTTCCCTGAACACAGTTCAAGTTAGCTGGCTGAACCCCGTGG 

TGGTCCCTCAGAGCTCCCCCGCAGAGCAGTGTCAGGACTTCCATGGAGGGCAGGTCTTTT 

CTCCACCTCAGAAATGCCAACCATTCCAAGTCAGGGGCTCCCAACAAATGATAGACCAGG 

CTTCCCTGTACCAGTATTCTCCACAGAACCAGCATGTANAGCAGCAGCCACACTACACCC 

ACAMCCAACTCTGGAATACAGTCCTTTTCCCATACCTCCCCAGTCCCCCGCTT 

Sequence 990 

GTCCGGCTGGGACCTCCTCCTGTTGGGGTCCCCATGMCCCTTCCCAGTTCAACCTTTCA 

GGACGGAACCCCCAGAAACAGGCCCGGACCTCCTNCTCTACCACCCCCAATCGAAAGACA 

ATGCCTGTGGAAGACAAGTCAGACCCCCCAGAGGGGTCTGAGGAAGCCGCAGAGCCCCGG 

ATGGACACACCAGAAGACCAAGATTTACCGCCCTGCCCAGAGGACATCGCCAAGGAAAAA 

CGCACTCCAGCACCTGAGCCTGAGCCTTGTGAGGCGTCCGAGCTGCCAGCAAAGAGATTG 

AGGAGCTCAGAAGAGCCCACAGAGAAGGAACCTCCAGGGCAGTTACAGGTGAAGGCCCAG 

CCGCAGGCC 

Sequence 991 

NCGCGTCCGCTTAAATGACTCGTTATCATTTTGCAATGAATGGAAAATCATTCTCAGTGA 

TACTGGAGCATTTTCMGACCTTGTTCCTMGTTGATGTTGCATGGCACCGTGTTTGCCC 

GTATGGCACCTGATCAGMGACACAGTTGATAGAAGCATTGCAAAATGTTGATTATTTTG 

TTGGGATGTGTGGTGATGGCXaCAMTGATTGTGGTGCTTTGAAGAGGGCACACGGAGGCA 

TTTCCTTATCGGAGCTCGAAGCTTCAGTGGCATCTCCCTTTACCTCTAAGACTCCTAGTA 

TTTCCTGTGTGCCAAACCTTATCAGGGAAGGCCGTGCTGCTTTAATAACTTCCTTCTGGT 

GTGTTTAAATTCATGGCATTGTACAAGCATTATCCAGTCTTCCAAGTGTTACTCTGCTGT 

ATTCTATCTTTMGTMCCTAGGAGACTTTCCAGTTTCTCTTAATTTGATCTGGCAATCT 

TTTGGGTAAGTGGGTATTTANAAT 

Sequence 992 

TTTTCACTGCAGGACCAGGCACGAAAGTTAAAACAAMTTGMGATTTTTTCTGAATCTC 

ATAAAACAGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATG 

TCGAGTTTGATATGCTGGTGMGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATAT 

CTAAATCATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATG 

ATATATTTCCTTTCMAAAGCTGGTGMTTTTATTGTGAGTGACTCTGGAGCACATGTTT 

TAAATTCTTGGACTCAAGAAGACCAAAATTTACAGGGAGCTAATGGCAGCATTAGCCGCT 

GTTGGGCCTCCTAATCC 

Sequence 993 

CGCGTCCGGGCAGGAGCACCACTCAAGGAGCTACACCCCTTGCATCGGCTTGACCGCCTT 

ACCTCAGGGGTGCTTATGTTTGCCAAGACAGCTGCAGTCTNTGAGAGAATTCACGAGCAG 

GTTCGGGACCGGCAGCTGGAGAAGGAGTACGTGTGCCGGGTGGAAGGGGAGTTCCCCACT 

GAGGAAGTGACCTGTAAAGAACCCATCTTAGTGGTGTCTTACAAAGTAGGGGTGTGCCGT 

GTAGATCCCCGGGGCAAGCCCTGTGAGACAGTGTTCCAGAGGCTAAGCTACAATGGCCAG 

TCCAGTGTGGTACGGTGCCGGCCACTCACAGGCCGCACACACCAGATTCGAGTCCACCTT 

CAGTTCTTGGGCCATCCCATTCTCAACGACCCCATCTACAACTCAGTTGCCTTGGGGTCC 

TTCTCGAGGCCGGGGCGGCTACATTCCCAAGACAAACGAGGAGTTGCTACGGGACCTGG 

Sequence 994 

ACGCGTCCGCGACCGCTGGGCATGCGGGTGTTGGCGCGGTATCCCCGCCCTGCCCAGCAT 

CTGCCCCACGTTTCTTCAGGCTAAACTACCGGGATCCCGGGCTTCTTCCTAAAGTAAAAC 

TCGCTCCGGAAAGGCCAACAGTCCAGCGGCCAGACGGGCACCTGGGAACGCGGGCCTAAC 

GCGTACTGGAGACGGAGTGGCGCCCGGCACTGCGCGCCTCCTCCCCGCCGGGAGACTGCG 

TGCTAAGCTCAGCAAAGCCCCGCTGTGGAGACGGAGCCATGTCGCCCATTACCTAATGAA 

ACTGAGAAGGGAGACTCAGTCTCTCTTCTAGCCCCGAGCGCAAGCTCTGCTGGACTTGGC 

ATCGTCCGCCCTCCACGATCCCACACTCCGGGTTTTCCCCATTCCCAGCTCGGCTGCAAC 

CGAGAGACAGACGGAAGAAAC 

Sequence 995 
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TCCTCCTGGCCCTGTTAATGTCGGGGCCNGGCCGGGGGAGGATGGCGCCCTAGAACCCGG 

CCTTGCTGGGGTAGGGGCGGGAGGGGACGGGGTGGGGACCGGCCATGTCGGAGGTGACCC 

GGAGTCTGCTGCAGCGCTGGGGCGCCAGTTNTAGGAGAGGCGCCNNACTTCGACTCTTGG 

GGCCAGCTGGTGGAGGCGATAGACGAGTATCAGATATTAGCAAGACATCTACAAAAGGAG 

GCCCAAGCTCAACACAATAATTCTGAATTCACAGAAGAACAAAAGAAAACCATAGGCAAA 

ATTGCAACATGCTTGGAAT 

Sequence 996 

CGCGTCCGGCCTGAGCCGGCGGGTCCCCTGTGTCCGCCGCGGCTGTCGTCCCCCGCTCCC 

GCCACTTCCGGGGTCGCAGTCCCGGGCATGGAGCCGCGACCGTGAGGCGCCGCTGGACCC 

GGGACGACCTGCCCAGTCCGGCCGCCGCCCCACGTCCCGGTCTGTGTCCCACGCCTGCAG 

CTGGAATGGAGGCTCTCTGGACCCTTTAGAAGGCACCCCTGCCCTCCTGAGGTCAGCTGA 

GCGGTTAATGCGGAAGGTTAAGAAACTGCGCCTGGACAAGGAGAACACCGGAAGTTGGAG 

AAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGC 

CGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTTCAGGTGCAGAAGCA 

CTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGGCCTTATTGT 

CAACAAGGCGCCCCACGACTTTCAAG 

Sequence 997 

GTCCGGCCAGGAGCCAGGCCGAGCGGGAGCTGACCANGGC7TGACTCGGGTACAGAACGA 

GGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCAGTGGATGGAGGAGGCCCAGCCCTGA 

GGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCCTACAGGGTGGGTAGGCGGGCGTGCC 

GCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCCGAGGGGCCTACGCTGCGGCCCGGCA 

ACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCCCACCCGATCGGAAGGCGGATCCTTT 

ACCAGGGCCATAGGCCAGTGACTGGGCGGGCCCCTTNGGGCCTCCCA7TCGGGGCCCGGA 

CTANGGAACNAGGCCCGGNNGAGGCCCCTTGGCCTACCAGACCCTTTTNTNANGCCGACA 

GCCGNCANGGAAAGAT 

Sequence 998 

CGTCCGGCCCAGAGCCCGCAGCACGCCGCCGCCGCAGCCTAGGTCACCTCCAGCATCTAG 

CACAACGTCTGCAATGGAACAGGCGAGCTGTGAATATTTGTGGAATGCATGGGTGGACTA 

AAGACCTATCACCTCACTCTAGAATGCCCAGCATGTTGGAGCATGAGGACCAAGAACCAT 

GGTGTTCCTCCACTCATCAGAGCCGTATCATTTTGATGCATGCGCCAAGAAAGAAAATTC 

AATCATCAGACTGAAGCAATCAAACTCCAAATGGTGCTGTAAACTGAACCACACATAGAC 

ATGCCATTCTTCTAAGGACCCTTAAGATCCACCCCAGGAGGAGCGCTAGCTGCTGTTCCC 

CATTCGATGCCCCTTTTCGGCCCGGAAGTAGCCGGAAAGATTCGCCCGCCAAAATTCCCC 

TAACCAGCAAGTTAGGTGTGGCATCTTCCACAAGCANGGAGCCGTTGTAGGAAAAAGGNG 

GTCTTGGGGAAGGTTTTTCG 

Sequence 999 

CCCGCGCCGGCAGTTTCNATGGTGTGTACTAATTTGAGAAAATGAATGTGTATACATACA 

AGAGTMGTCAGATTGTTAGACTCATCCCTCAGTATTCCATATGTTTTGTGACTGATTTT 

ACAGTTCTCTCTACCTTTCTCATTTACAMMMAAAAAGAAMGMAATTTGATTCAGC 

AATTCCTAAAAGTATTGTATTCAGTGACATCTTTGGAMCACCAGTTTCTGTTATCAACT 

TCAMTMTAGTCMGTTTTATGTATGATCTAMGGGMMCMGTTTGTTTTTCAATCC 

TGTGATAAT TTTTC TTTTAGMTGAGGTGTTGCAAGAAATGGAAAATTAAAAAAACCACT 

CTGTMCAATTTTGCTGTGCTTCTTTGATTTTTCTCTG1" I I I I GTTTAATGGGTACCTTA 

TATTTGTACCTTTACATATTGMTTCATGAGGAGAGGTTATGCACAGCCTAGTTATTTGA 

CATTCCAGGGGGTTTAAAAAAAA 

Sequence 1000 

CCCGCGTCCGGCGGTGGCGGTGGTGGCGGTGGCGGCGGTGGCGGCGGCGGCGAAGGGGGC 

GGAGAGGAAGGAGCGCGGCGGGACCGGGCCGGGACAGCGCGTACTTTGGGCTCCGGGAGT 

CGCTCCGCGCCCGCGGTTGTAGCAGCTGCCGCTGCAGCCATAGCAGCAGGTCAGTCATTG 

GCACCATGAACTGGAATAMGGTGGTCCTGGCACTAAGCGAGGATTTGGCTTTGGAGGTT 

TTGCCATCAGTGCTGGGAAAAAGGAGGAACCCAAACTCCCACAGCAGTCCCACAGTGCCT 
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AGGMGMGATTCTAGCMCGTTTGATTTACCTTACATTCCTGCT 
Sequence 1001 

CCGGCCGCCGCCGCCGCCCCGCCGCCACCGCCTGGGGGTTGGTTGAGGCGGACGGCGGGG 

TCCGGGCCGGAGTACGTCGTTCCCGCTGCGCTAGGGGAAGCGGGCAGTCAGAAAAATGGG 

TAAGAAGAGTCGAGTAAAAACTCAGAAATCTGGCACTGGTGCTACAGCAACTGTGTCACC 

AAAGGAAATCTTGAACCTGACCAGTGAGCTGCTGCAGAAATGCAGCAGTCCGGCGCCTGG 

CCAGGAAGAGTGGGAAGAGTATGTGCAGATCCGGACTCTGGTTGAGAAMTACGAAAAAG 

CAAAAAGGTCTTGTCCGTTACTTTTGATGGAAAAAGAGAAGATTACTTTCCTGATCTAAT 

GAAATGGGCCTCTGAAMTGGGGCTTCTGTCGAGGGTTTTTGAAATGGGTTAACTTCAAA 

GAAGAGGGGCTTTTG 

Sequence 1002 

GTCGACCACGCGTCCGACGCACCAAAGGGCAAATACTCGGTAGCGACTCAGAGGGAAAGT 

GGGGTCTCTCCTGGGAGAGCAGGAGGCTGCCAGAAAAGAACTCAGGTCAGGGGTGCATAG 

GCGGCTGAGGAGTGCGGGACGGGCTGAGAGTTGGGGTGCCTCCCCGCCCGCAGGTGGGTC 

CGCAGATCCCGCGGGCCGCATTGGCCCGGCTGCTGCGGGATGCCGAGGGGCTGCAGGAGC 

TGGCACTGGCGCCGTGTCACGAATGGCTGTCAGACGAGGACCTGGTGCCGGTGCTGGCGC 

GGAATCCGCAGCTGCGGGAGTGTTGGCGTTGGGCGGCTGCGGGCAACTGAGTCGCCGGGC 

GCTTGGGGCTTTGGCCGAGGGC7TGCCCACGCCTGCAGCGCCTGT 

Sequence 1003 

CGCGTCCGCTTTN(X)TTCTTGGTTCCACCTCAAACATCCCTTCCGMGTGAGGCTTTCCC 

TGACTGGGGAGCATAAAGTAGCATCTCTCACATNCCATACACCCCTACAACGAATCTATG 

CAATGGCCCTGCTCTGCCATCGCCACCTGAAACCATCTCAATAAACACATTTTGGATAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 

Sequence 1004 

ACGCGTCCGTTGGCTGCGAGGAGCGCCCGAAAGGTCAGAGGAAGGAGCTGTGGGAAGCTC 

GCAGCAGGTATCGGAGCTTAAGCCAGTGGATTTGGGGGCCCTGGGCTCCCTAGCCGGCTG 

CGGTGTGAGAATGGAGTGGGCAGGAAAGCAGCGGGACTTTCAGGGCTCTGGGTTTGCAGA 

GAGCCGAAATGACCATGACTGCCAACAAGAATTCCAGCATCACCCACGGAGCTGGTGGCA 

CTAAAGCCCCTCGGGGGACTCTGAGCAGGTCTCAGTCAGTCTCTCCACCTCCAGTTCTCT 

CCCCACCAAGGAGTCCCATCTACCCGCTCAGTGATAGTGAAACCTCAGCCTGGAGGTACC 

CCAGCCACTCCAGCTCCCGGGGTGCTCCTTAAGGGACCGGCACCCCCCACT 

Sequence 1005 

NCCACGCGTCCGGCAGCGCTGCGACGGGACCGCGCGATTCCTCTCCCACGCATCTGGCCC 

GCGTTCCTGGGCCTCGGCACCGGATCCCGGCGGGGGTGTGGACCCAGGGCCCACTCTCCC 

CGGCGCGGCCAGGGCCCCCCAGCGTGCGAGCGCCTAGGGGATGCCGAGCTGCTCAAGATG 

AGGAGGTGCGCGGGGCCGGGGCGGAGCAGTCGCAGTTCCCCGCGTGTGAGCCCCCACCCA 

TCCCTGGCGCCAGCGCTTTCCCGACCACTCGGGTTCGGCTATGCGGGAGCCGNGAGGAGG 

AGGCTTGCAGTCGTGGACCTGAGACCTCGGGAGGTCATGCTGTCTTGCTTTTAAGTGGCT 

TNGGGGAAAGTGAAAGAAAACNCCNAAAATTGGAGGACTTTGCTACCAGGGACCTAACGG 
CACCAGTGG 

Sequence 1006 

ACCACGCGTCCGGGAAAAGCCCGGAAGTGCCACGGGACTTCCTGTCTAAGGAAGAGCCTC 

GTGAAGCTCCTCCACTGGGGAGTCAGTGGCCTTCGTTGTATCTGCCCCGCTTGTCCACCT 

CCTAGAGTGAATCCCCGCCTGGAGGCTGGGACACTAACCAAGAAGTGGCACATGGCATAT 

CACGGGAGCAATGTTGCCGCTGTACGGAGAGTGCTGGACCGAGGGGAGCTGGGAGCAGGT 

ACTGCCTCCATCTGANGCCGTCCTTTGMGGGAGAACCTGGGGTAGGGTTCGAGGAGCCN 

GCGAGAACTGTGCACCTCCTCGGGAGGAGCAGCCCCCTCCTGTGCTGCTTTCCCCCTCCC 

TTCAATATGCTGGGGGCGGAGACCCTGGCCTCCAAAGTGCAATTCCGGGACCCCAAATCC 

CAGCGGACGCACCAGGCTTTAGGTGGGCGTCAAGTTGNTGTGTGCCCCCTGGCTTCTACA 
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CCCCGGGACCCCCTTCCG 
Sequence 1007 

TCGCCACGCGTCCGGAAAMTTTATGCCTTTTTATTCATAACCCAGCTGTGGACCACTGC 

CTGAMGGTTTGTACAGATGCATGCCACAGTAGATGTCCACATMTAAAATTCATAGTTA 

CCMT GCAG TTTTGATATATCATTGGATTCTGTCTTTGAGTTGTAGGTTATTTCTTAGCT 

GCATGTTTTAMCTGMTTTGCATAGAGTTGTATGTTAATGTTTCAGTTAAGAGAAAAAC 

TTAAGATACATGAGTCATTACATAATGGGTATGAAATCTTTATAATCACCCTTCCACCCT 

CTATGGTGTCAGTACACATCACGTGTCATAGATACTTAAMTGTAAATGTTMCACTTTT 

CCTTCCTGCTGAGGATGTTTAGAGCCTAGTGCCAGACCCATTCATTTCCTTTTGATT 

Sequence 1008 

GCGTCCGGGCNGCGGAGTTTTGTCCATAACGTGGGCAACCGCGCAGCTGGAGGATGGCCT 

CACTCGGGCCTGCCGCAGCTGGGGAGCAGGCGTCGGGGGCTGAGGCGGAGCCGGGCCCCG 

CGGGGCCGCCGCCGCCGCCCTCACCGTCCTCTCTGGGGCCCCTGCTCCCCCTGCAGCGGG 

AACCTCTCTACAACTGGCAGGCGACCAAGGCGTCGCTGAAGGAGCGCTTCGCCTTCCTCT 

TCAACTCGGACTGCTGCGATGTGCGCTTCGTACTGGGCAAGTTGGCGGNGCCGCCGCCCG 

CTGGGGGCCCGCAGCGCATCCCCGCCCACCGCTTCGTGCTGGCGGCCGGCAGACGCCGTC 
TTTG 

Sequence 1009 

GCNCCCCGCGTCCGTTAGAGCTCAGGAAGTTATTAGGTGCAGCCTCTGGAGCCATACTCA 

CGCTGCAGTGCATMTGGGAAAATTAGGAGCATTAATAAGAAATTTCAGTAGTGTTTGTA 

AGGAAAATAAGCTACTTACTGAGATCTGTTTCTTCTATTGCATGTTTGCTTTTGAGGGAC 

AGCTT CTGTCA AAAGTGAMTCATCACCAGMCTGGGCCTGTTAGGAAGAATAGGGTTTT 

ATTTACTTTTTATGTCAATTAACTTCAACAAAAAGGCCACGCTGGCTGCTGTCATGCCAT 

CTGGGTATGCATTAAACATTMTGATGATCAGCCTTGAGGTTCTATTTATCTTGATTTGG 

CTTTATAAAGTTTGTCAGAATGGTGCTGGAGGNCCAGAAGTGCTAAGGAGAAAGAAGCTA 

TGGGCCMGTTAAAGAATTTGMTGCAAAGGCCAGGNATGGGAGTTTTCATAA 

Sequence 1010 

CGCCNCGCGTCCGGTGAGCCCCAGCAAGGAGATCAAGATCGTGTCTGCCTGAGGAAGCAG 

AGCCATGACAATCGGAAATCTACCAGCTCAATGTCCTGCATGTAGACTACCGGACCGTGA 

GCAATCTGATTCTGACGGGCCCACGGACGATTGTCATGGAAGTCATGGAGGAGTTAGAGT 

GCTGAGCTCCTGGGCCTCCCAGCCCTCCAGTGGCCTGTGGGTGAGGGAAGCCAGAATGAC 

ACAMGCMTGCAMGACMGATTGCCATGCAMTGGATGGTTTTGGACATACGAGTCTT 

CTCCGC ACATAC ATGTCTAMGTTGAGTTTTATACACTGGAATGTGGAAGAACCCGGGTA 

TCATATCTTTTTTAAAAAATGTCCAGTGTAGAAAACATTTGGGAAAC 

Sequence 1011 

ATTTTTCTMCATGGGTTTGMCGCTTATMCCAGTTTTATAAACCCCTTGAACACTGCA 

GTGAGTTATCAAAGCCACTGCCTGCAMGTGGATGATTTAAGATnTACACGCATGAAAA 

TGAGTGTGCCA TCTCC TGACCAGTGCCTTTTGACTTAGGTACCCAGATGCCACTTGTCAG 

CAGCAGGATACTTTTTACAACACGAMGCATMTTATTTTAGAAGAAGAGAGTAGAAGGG 

CAGMTAGMTTCAACTTACAGAAGCACCGGAGTAGTGTTGTGGTTGGCTGTTATCTGTC 

CCCCTGGGAGGAGGGACTGTTTTGCTCCCTTGTTTTNGATGTTAAACAGTAGCTTAAAGG 

CTTTCCCCCCCATACCAACTTACAGNCMATGACMAGMCCGGTGGNGGTTTTCAACAG 

ATTCTACAAACATGCATTTTTCCCTTCCCACTAAATGGG 

Sequence 1012 

GTCACCNCGCGTCCGCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCC 

AGCCCCCCACACTAGGCTGTTCCATGCAAATGAGGAGGAGGAGCCAGAGAAGAAGGAGGT 

ATCGGAGCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGA 

GGCCCTCAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTT 

GGAGGTGGACCTGCTGAAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATC 

AGGCTCCACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCT 

AGGCCTGGCACTCACCCATTCCTTCGGCCCCAGTCTTGCAGACACAGACCTGTCACCCAT 
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TGGATGGCATCAGTACTTTGTGGGTCCAAAGAGGGGAAGTGACCTTCGGGTGGGTGGTGA 
AGG 

Sequence 1013 

CGCGTCCGAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTGC 

AGGGGATGGTATGAATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATT 

GTCTCAGMGCAMCAGTTTTGCAGGTAAAAAAGNAAAGATNTTAGTGACTTTCTTGGGT 

CTTTATTGAGAAGCTTTCCNGAGNAAGCCACTCTCAGAMTTGGCTTCATATGTTCCCTC 

CCGCCCCCGGAGTAAGNAGCCCTCATAGNGGGATTGACATAAAAGTTGAAAAGNTTGNGG 

AACGGAAAGMTTCNTTCTTTCTGGCAGNMTTTTCTTTGAAAAAAACCNGAGGGGCCCG 

CTTTAGAAAMGGGTACCCTTTCAGGAGGGATTTGTTAAAATTCANTCCTACCCCATGGT 

TTMCAAGGGACTCCTTGACNGTTCAGTMGTAGGTTTCTTTGNNAAAAAAGGAAAGGAA 

GCTGCCCACCGTGCCCGTNGGGGATACCCCNAAGACAATTGANGTTGGGTGGCTTGGNTC 

CTTTNCTCAMGGAATGGTTTTCAAANCAAAAAGCCCACCAAGATTGCCCNGTCCANGCC 

AAAAAAT 

Sequence 1014 

GTCGCCCCGCGTCCGCGGNCGCGTGGGGTGCTNGTCACCAGACTGCACCCTTGCCAGCAG 

CTTCGCAGCTCTCGAAGTAANTTATCGCANGATGGCCGGCGCCTCACCTAGGAGAACCAG 

GMGGCAGGCCNCGCTAGAACGACGGNATTGMTTTTACTATTGNCAAAACAATCACATT 

CAAATTCATTCCACTTAAAACCTGAAAACATTGGACCACACAA 

Sequence 1015 

AGTCGACCACGCGTCCGGGCGGAGGGAGCGTGACTGCGCTGCGCAGGGCGCTAGGAGGCA 
TTGTCGCCGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACA 
CCTGTGTCTGCATGCTGAAGAAGATGGGTGAGGCCGTGGCCAGAGTAGCAAGGAAGGTCA 
' ACGAGACGGTGGAGAGCGGCTCTGACACTCTGGACCTGGCCGAGTGCAAGCTGGTCTCCT 
TTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAGATCCACCTCATCACCC 
TGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTCC 
GAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCCCCAGCGAGGGTCAGTGCCCTG 
CAGCACCTCAAGGCCATTGACCTGTCCCGGAAACCAAGTTCCAAGGACTTTCCT 
Sequence 1016 

CGCGTCCGCTTTCAGTGAAGAAAAGGGAATTACACATNGAATCGACACATCAGTAATACC 

GATACAGTGAAATGGGCCTCTMTAAGMTTTNAGCGNGTTTTCTGATGTGCCATTTTTT 

TTGTCTTTTTTAAAAATATACCATANTTATAAAANTGGNAAATANNTTTTTGNACACCAT 

TTAAATTGACCCCCTTANAGNACNCTTGCCGTNATGNTGAAANGCTAGACCTATNGAAGC 

TGNCTTGANGATATNTGTTTTTTTAAAAAAATTT^ 

TAATATGCACTATAAAATATGATCNTATATCCTATTATCTATNATCTAAAAACACTTCCT 

T GGACNC ATTTANACGTAAAATTAAAAATGGGTCTTTAANGAAGANTAATGGGGAGGCCC 

CTTTTTTAAMCCTATGGNNCMTCTTTTTATC 

Sequence 1017 

GCGTNCGCTGCGCCCGTGGGACCGGTGAAGTTCTGGCGACCCGGTACAGAGGGGCCAGGT 

GTAAGCATCTCTGAAGAGAGACAAAGTCTGGCTGAAAACTCTGGGACAACGGTTGTTTAC 

AACCCTTATGCTGCCCTTTCCATAGAGCAGCAGAGGCAGAAGCTGCCGGTATTCAAGCTT 

AGGAATCATATTTTATACTTGATAGAAAATTATCAGACAGTGGTGATTGTTGGTGAAACA 

GGATGTGGGAAGAGCACACAGATTCCTCAGTACCTTGCAGAAGCCGGCTGGACAGCTGAA 

GGAAGAGTGGTAGGAGTGACCCAGCCTCGAAGAGTGGCTGCTGTTACACATGATCTTTCT 

TNCCAAAGGTTGCAGGGAGAGTAGCTGAAGAAAGGGGTGCAGTGCTGGGCCACCAGGTGG 

GCTACTGCATCCGCT 

Sequence 1018 

AGTCGCCCCGCGTCCGGTGGGAATCTTTCNACTTCTTGATCCATCTGGGAGAGAAGACGT 

ACCATTATGTGCCCGAATTCCGAAAAGTGTCCATAGCAGCTACCATCATCTATGCCTATG 

CCTGGCTGGTTCCTCTTGCACTCTGGGGTTTCCTCATGTGGAGAAACAGCAAAGTTATGA 
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ACATCGTCTCCTATTCATTTCTGGAGATTGTGTGTGTCTATGGATATTCCCTCTTCATTT 
ATATCCCCACCGCAATACTGTGGATTATCCCCCAGAAAGCTGTTCGTTGGATTCTAGT 
Sequence 1019 

GGAGTCGCCACGCGTCCGGTGGCACGATCTTGGCCCACTGCAAGCTCCGCCTCCCAGGTT 

CACGCTATTCTCCCGCCTCGGGAGCTGGGACAACAGGTGCCCGCCACCACGCTCGGCTAA 

TTTTTTGTATTTTTAGTACAGACGGAGTTTCACCGTGTTGGCCAGGATGGTCTCGATCTC 

CTGACCTCGTGATGCACCTGCCTTGACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA 

CTGCGCCCGGCCMTAATTTTTTTAGTTTAAGTTCA I I I I I GTCCCCGCTGATGAAAGTT 

ACAGCAGCTGCCCAGTGCCTCTGTCCACACCCACCTCCCTGGTGTACTTGCCCCCTACAG 

CAGCAGCCCAGATCCTCCCCTGGATTAAAATTGCAACTGGTGCCCTAACCCAAAACTTGA 

GAAAAAATTCTCTATCACATCCACTCTTCTGGCATTTGTAGAATCTC 

Sequence 1020 

GTCCGGTNGATATATATATTTACCTCCTTAGTAATGCAAGAAGTGTTTGTGGGAAGCAGA 

GMGCAAGCAACTGTATTTCTTGTTCTCACCTAAGCATTACTGGAGGATAAGCCACATCA 

GTCTACAAAGAGGTTTTCATACAAACATAATAAGATGTAAAATGGACCAAAAGTTGAAAG 

CACATTCTTGCMGTAAGCACCTGTTACTCTCCMGCAACCATGGGTTTACCATATTTGG 

GGA TTTTTT GAACACTTAGNCACTTTCTTGCTCCCNAAGGGGACNTTTACAAAAAGTGNA 

NACATTTTTGTANTGTNNCCCGTTATTAAAAAAGCTAA 

CAAAAGGGCTTGGTNTTTTTGGACAAATTCAAAAATGGAAMATGGATTTTCCACCGTTT 

TMGCCTCMTTTTAAGCCCCAAGGTTCCCAAAAATTT7TTMNGAAAAMGTTATTCGG 

GTATTAGGTNGGNCCTGGGTTAAAAAACCAAGAANAAAACCATTNAAAACAAAGCCCATT 

CAAAAATTCTTNTGGAAAAAAA 

Sequence 1021 

CAGGGAGTCGACCCCGCGTCCGAGCATCTTGGGGAATTTATATTCCTTTGTGAGAAATGT 

TTTGATCATAAGCCTAGAATGATAAGTAAAAGAAATAAAGATAATTCTACTGCTTGTTCT 

CACCCGGTTACAMGCATGAGTTTGAAGACAATMGTGCCTTGTCCACATTTTGCGAGAG 

ACAACAGTAMATACTCCMAATACGTTCTTTTCATGGTCAGTGTCAGCTTGATTTATGT 

CGACATGMGTTCGGTATGGCTGTTTAAGGGMGATGAGTGCTTTTATGCCCATAGTCTT 

GTGGAACTGAAAGTCTGGATAATGCAAAATGAAACAGGTATCTCACATGATGCTATTGCT 

CAAGAGTCTAAACGATATTGGCAGAATTTGGAAGCAAATGTACCTGGAGCGCAGGTCTTG 

GTAATCAAATAATGC 

Sequence 1022 

CNCGCGTCCGCAAAGCAAACGACAGCTCAGGGGGCTCCAAAGACCTCATTCATAGCAGCA 

AAAGGAACTCAGGTAGTCAAAATATCAGTACACATGGGACGTGTCAGTTTAAAACAGGAG 

CCCCGGAAGAGTCATAGTCCCAGCAGTGACACATCAAAACTAGCAGCTGAAAGGGACTTG 

AATGTGACCATCAGTCTTAGTACTGATAGACCAAAGCAGCGATCACAGGCAGTAGCAAAC 

GAGAGGGCACACCCTGCCAGCACAGCAGTGNCGAAGTCTGGGGAAGCCATGGCCTTAAAC 

AAAACTAAGACTCAGAGCAAAGAAGTCAATGCAAATAAACACAAAGCCAATACGAGTCTT 

CCTTTTCCTMGTTCACTGTCAATTCAAATCGCTTAAGGAAGCAATCTATTAATGAGACA 

CCTTTGGGAAGTTTGTCAAAGGATGATGGAGCTAGAGGGGCTCATGGGG 

Sequence 1023 

CNCGCGTCCGGCCAACCGCCGAGGAGCAGTGCCCTATTCAGCACAGTGCGCCAGGGCCAC 

TGGCAGA7TGTTGATCTTTTACTCACCCATGGAGCTGATGTCAACATGGCAGACAAGCAG 

GGCCGCACTCCCCTGATGATGGCTGCTTCCGAAGGCCATCTAGGAACCGTGGACTTTCTG 

CTTGCACAAGGTGCCTCCATTGCTCTTATGGACAAAGAAGGATTGACAGCCCTCAGCTGG 

GCTTGTTTGAAGGGCCATCTCTCAGTAGTACGTTCTCTGGTGGATAACGGAGCTGCCACA 

GACCATGCTGACAAGAATGGCCGTACCCCACTGGATCTGGCAGCTTTCTATGGCGATGCT 

GAGGTGGTCCAGTTCCTGGTAGATCATGGGGCCATGATCGAGCACGTTGACTACAGTGGA 

ATGCGCCCTTTGGATAGGGCAGTGGGGTG 

Sequence 1024 

GTCGCCCCGCGTCCGAGAAGTCAGGGAGTGGAGGTTCTATAAGGAATTAACAGCTGAGGA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 
165/467 



CGGMGGGTTTGTTTCCCGTTTGAACCTAAACGCAAGTGGAAAAGAATACTCAGAATGTA 

TTTTTCTACTTTACATCTGCTGGGGAAGGAAATGTGTCAGGAAGCCGCTGCATCTGGTCA 

TTTCATCGCATCAGAATCACAGCAGACGTGGAAGATTCCATGTGGTGGGGAATAAAGAAA 

TAACTTTATGCTCTCCTGAAAAACAGCGGGAGCCTATGTGTGTGTGCGACACTGTAATCT 

CAAGGAGATTCACTCAGAGCTGTCTCAGTCCAACTCCTGCATGACCAGATCTTCCCTTAG 

CATCTTTTCTGTGATGAMTATTATCTTGTGTTAGAGTTAGGAATAGGAACTAACCTGTA 

GGAGCATGTCCCCAAATGGACATTTGAATGGACTAACAAAAACAACTGGAAAGACTGAAT 

TTCCGACACAAAGGAATGATGGGATCAAAAAGAAAGC 

Sequence 1025 

GGAGTCGACCCACGCGTCCGGTCGACAGCCTCCGCCACATCCTCCACCTCTCTTGGTCCA 

GCGAGCGTTGCCGGGCCAGGGTCAAGCGGAGGGCTCCGACGGCGCGGACGGAGCGAAGCG 

CCGAGCCATGGCGCACCAAACGGGCATNCACGCCACGGGAAGAAGCTGAAGGAATTCTTT 

GCCAAGGCACGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTNGAGGACCGAGCAGT 

NTCGTGCCTGGGTGCNCTTCGCAAGGGAGCCAGNTAAGGCNCGCTNGGGGATCAGGGACT 

ATTGAACAGGNGGCCCGTGGCTTGCNCACCTGCNTGGGACCGCCCCAGGCAGGCCCTGCN 

TACCTGGCTCTACCGCGCTTCGACTNACAAGAAATGGCTCAGGGGNCTTTCGAAATGGGG 

CTTCTTTCCTTCGCCCTTGGTTCCGCNCTNGAATAAACCTCCCCCCCGTGGCGGCTTGAA 

AGANTGCCTGTACCGCCGGGNCAATGCNGGGCCCCACAAGTGGAAAAAAGGGAAG 

Sequence 1026 

AGGGAGTCGACCCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGG 

GCCGCCCGACCCCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGGNGATTCTGCTGC 

TTTTCCTGACGGAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATTAACCGC 

GGAGATCTGTTCTCCTGCCCCTAGGACTACGGACCCTGCCGGGCCCTACNTTCTCCGTTT 

ACNTACTACGACAGGGTACACCGCAGTAGCNTGCTCGCCAGTNTCCTGTTACNGGNGGGC 

CTGCNGAGGGGGCMCCGCCCAACCAATTTTCTTACACCCTGGGGNAGGGCTTGCCGAAC 

GAATGCCTTGCTTGGGAGGGATTAGNAAAMAAGGTTTCCCCAAAAGTTTGCCCGGCTGG 

CAAGGATGGAAGTGTTGGGACCGAACCCAGGATGTGAAGGGGGGGTTCCACCAGAAAAAA 

AGGTTATTTTTCTTTTAATCTTAAAGTTTCCANTGGMCATGGTNGAAAAAAMTTTC^ 

TTTTNCGGGTGGGGGTGGTTCACCCGGGAAC 

Sequence 1027 

CGTCCGTAGTCTCTCTCGTGGCCCGGAAAAAAAAAAGAAAGAAGGTTGGGGCCAGTCACC 

CCCACATCCCTTTATGGAGGCTTCCAGATCATGGATCCTGTCACGCGCATCCCGGTGAAG 

AGAGTCCACCAACAGGCTTTGTATGGGGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTG 

GTTCGATATGGCTCACTGCAGCCTCAGCCTCCCTGGGATCAAGTGATCCTNTCACCTCAG 

CCTCCCAAGTGGCAGGGACCGCAGGAAGGCCTGGACGACGGCCCGGACTTCCTCTCAGAA 

G AGG ACCGCGGACTTAAAGCAATAAA 

Sequence 1028 

CGCGTCCAACCCCTTCCTCAGCCTGTCGGGAGCAGAGGGCAGTGGCGGTGGCCCCAAAGG 

AGGGACCGCTGACAAAGGAGCCTCAGCCAACCAGGAAAAAGGCTAAATCCACCCTTACCC 

CTCCTGACCCCCCCAAGTGGAGGGAACAGATCCTGGCCTGAGGGGTCCTAGCCTGGAGCA 

GGCGCCTGCGCCCAGACCCTGGAGAGCCTTGACCCAGAGCCTGTGCTGAGGTCCAGGGAG 

TGTGGAGAGCTCCTGGTGTCGAGGACTGAGACTGACAGGGGAGCCCCCTCCATCTGGCCC 

CCTTCCCTTTCCGCACTGTCCGCTTTGTGAGGCTCAGAGGAAGGACAGTCTGCAAGCCCG 

CCTAGGAGGTCCATCCCCAGCAAATGTTTTGGAGGTCCCCCCAGAGAGCAAAGTGGGCCA 

TGGCAAGAAGTAGGGGGTTGGTTGGACCTGTCACATGAAATGGATCAACACTTGAATGGG 

GA 

Sequence 1029 

CGTCCGGAGAAGATGGGCCTCCCGGGCTCAGACTCACAGAAAGAGCTGGCCTGACCACCA 
GGCACCTCACTGGCACTGCTGACCCATCCCAGAAACACAATCTCAGGGACCCGAGCAGCT 
CCAAGGACGAGAGGATACAGCAGACACAACCTAATAGAGAGGGCGCCTGCAGCCTTAACC 
TCCACGGCCTTCGATACTTATGCAAGCCTGGTGTTGCTCCTGTCCTCAGAGTCATCCTGC 
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GCTCATGCCTTTTCCCGMTGGGTTCACCTCTGGCAGTTGCCGCTTCAGTCTTGGCCTTA 
GCCTCATCTTGAAGTGGGTAGCTGGCGGGAGAGGGTGGGCTGCGCCCCCTGCTGGCCCTG 
AGGCTGCAGAGTTGGGAGCAGGACACCTCACCTGAGTTTCAI 1 1 1 I I I I CATGTCCAAAC 
CATGCACATACTATAGTCCAGAATCAAAGCACTTTTGAAAA 
Sequence 1030 

CCCCCCCGCTTTTTTNNANNCNCGGGGGGTGNGGCAAAACTTTTTCCTAGAATTNGCGCC 

ATGTTGGAAACNNTNTGNCAGCANCCCGGCTGCACAGGGNGTAGCCCANCCTCCAGGTCC 

ACGGAGTGGTGTGGACCTCCCACCTCACAGCTGCCTCTGGCAGCCAAGCCTCTTTTCGCC 

CGGCCCCAGCCCCTCTGGTTGATAAACGGGTGGGCCTCCTCAGCAGCGTGGCTGCCTTTC 

ACCTTGATTTCCCCAGGGCTCTCGGCAACATCGATAAACCAAGCCTCGCCCACCAGCTGG 

GCCCTCCCCCACCCAGTCTGCCAGGCTGGGAGCTGGAGCTTGCTGAGTCTTGAATGCCCT 

TCTAGATGGCTTCTCTAGAGGCTCTCCTGGCAAGAGAGGGTCCCAAGGGGAGCCCTGCAA 

AGCAAAGGCTCCTTGTCTGGGGCGGGATAGAGAATCTCGCCTCTGTCTGGTGTGTTACCT 

ACTGGGGGCACAGGAACMTTTCCTCAAGGAGACAGTGGCATGGAGCTTTGAAAGACGAA 

GTANGTGTTAGCAAGGAAATMGGAGGAACGGGGGTTACGGGCAGAGGAGAAAGCACATG 

CCMGTCAGCAAAGAAAAGTAGMTTCGAAAAACTTTTTTA 

Sequence 1031 

GGCCAGAGCTACTACGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGC 

CGCAGCCCGGGAGCGGAGCAGCCGGTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGC 

CGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTT 

CCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCA 

GGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCTGGMTATCTGCCCCAGTTGTCTTTTT 

TGTGGACTATGCTTCCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCG 

AGATTATATTCAGTCCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGC 

CMGACAATTGGGCAGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCT 

CACCACCTCCAACATGCTNCTGAAACCCCCCTGGAACAAGCTGAACATTGTGCTTATAGA 

CT 

Sequence 1032 

TCG CCCCGC GTCCGCAATTTCTTTTGMTTTCGATCACTTCTACATTCCCAGCTTGCCAC 

ACTCTTTTTTGATGAAGTTGTGAAGCAGATGGTAGCTGCCTTTGAAAGAAGAGCATGTAA 

GCTGTATGGTCCAGAAACAAATATACCTCGGGAGTTAATGCTTCATGAAGTCCATCACAC 

ATA MGGC AAAAMGAACTGGTGCCACCTGCTTCTGACTTTAGTTTGTTCACTTTTAGGA 

AGTATTTTCATGACATGTTTTCAGAAGCCAGAAAGCATTTGTTAMCGCAGCTTTGGTTA 

TAMCCTGCACCATTGAAAATTTGCACATAGAATATAGACTCACTTGTACATAGAATTAT 

TTCTTCAAGTATAATTCAAAATAATATGGACATTATCATGTTCTGCATTACAATAATGGG 

ATGTCATCACCATTGCTAGAATACTGGCATGATTCTTCTGAGCAGAAGTTGAAACTGTAA 

ATTTAAACCTTTTAA7TATCACCTTACCT 

Sequence 1033 

NCGCGTCCGGCCTTTTGTTCAGCTTGCCGGGCAACCGGCCCCTGCTGGGGACTACAAGTC 

CCGTAAGCCTCCGCGGCGGCACGTCCTACCCTACACTGTCCAGCCGGCTCCCTTTTTCCC 

CCTCCCCGGGGGCCAAGGGCTCCGGCTGCTGCCTGGCGGCCAACGGGCCAGGTAGGATTT 

CCGGGAGAGGCTGCTGTGGAGGCTGAGGAGGCGGCGGCGGAGATCTGGAATGAATTTTCT 

ATCTGTGAAATTGTGAAGACGMAAAAGAAAATTGTGCTAGTTTTGCTGTTGTACTCCAC 

ATTGCAGAAGTTACAATCACAGTGCACAAAACAGTATCTCACCTCCCTAAACTGGTTAAT 

AGTGGCATGGAAGATCCATTTGAGGAAGCAGACCAGCCCACTACAGAGCCAGGCATGGTC 

CTGGACAGTGTGGAAGCAGGAGACACAACACCTTCTACCAAAAGGAAGAGCAAGTTCTNA 

GGCTTTGGCAAGATCTTTMGCCCTGGAMTGGAGGMAAAAMAAGTAGTGATAAATTT 

A 

Sequence 1034 

CGTTTTNNTTCTTCCCTTGGATNMTATTTTCTACTGAAAAGAAATGAAATCTCAGTTCC 
ACTATGATAAAAGAAACGAATCACCAGTGTCTGTCAGTGCCCCCACTGCTCCTCCTCATA 
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CCAGCATTGACACTGGTAGCTGAGGAAGGGGACTGCTCAATTTCTAATGTGATCTATTCA 

AGMGCCACATATAMAAGGCATTGAGGGCTCACTGTCCAGAGAATTGCTTTTGTAAATG 

CTCCACAACGTATGGGCAAAAGTTTAAGGACTACTGCCTGATGTACCAGGAATCCTGAGT 

TCTGTGAAAATCTGTTCATTCCAMTCTGTTAAGCATTTCCAAACATCCAGAGAATGGTT 

TCACTGTTGAGGGTGCATGTGGCAGAATCTGTCTCTCTCTGCACCTGTCTTCTGTTACCA 

TCCCTGGACAGTGACAGATTTTMGCCAGCCACCAGAATTCTTTTCGAGTTMCCATTTC 

TTAGTTTCTGTAAA 

Sequence 1035 

GTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCCTCCCTCCCGGGGGGTG 

CACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTC 

CTGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCC 

GGCAGCACACAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCCGCCGTCG 

Sequence 1036 

GTC CGCC CCGCGTCCGG GAAAA TGCCGCAGTTTGTTGCCTTGAAACCTAAGAGCAATCCT 

TGGTTTTGTTGCTACATTATTTTTCCAGACCMCACATCTACCAAGTNGMTTTTATNNA 

CTTTMTTTCATAATAMGTTAGTAGAGTCACTCMCTTACAAACTTTATTTATGNTGGC 

TTT GGGC AAAAAAATCACTTATAAGGCAGCTCTAAATTTGCCTTGATAAGCTAAATAAAT 

TACTTTrATAACTTACTAMGCAGMCAAACAGTGAAACTTTCTAAAATATTCTATNCTG 

GAAATAGN GGACAG GGG GATCTTTTAT TTATAATNCTCATCAAGATGAGTGAGGTTGTTC 

ACCAGGATATTTTTTATGG I II i I I II I MTTTTTCTCCAAAG N AATT ATTTTTAATAGG 

AATTCCCAAAAGNAATCAAGAAATTAGTTTTTCAAAAATAMTTTTTTCCAGNTNGATA^ 

MAGGAAGGTNGTTTGTAAAATTAMTCCATTATTTTACCACCTTAAAAAATTTGGGGGG 

MTACCATTCTAAAGGGMCCTTTTMTTCTTTACCTAATTCANGNTAGGGGNGTCTTGG 

CMTTTNGAATANTTTTCNTTTT 

Sequence 1037 

GCGTCCGAAAATATTTAGGTAATGTCATAAAATTTATTTTACCTTTCTCATTTTCTGAG 

AMTAAATGAAAAAMCCCTAGATATTGCTTTATTACCAACAGTGTGTAGGTTTTTGTAC 

ATATGGAAATTTGACACAMAAAATAGGGAATTTGTATAGAGAAGTTTCCCTCTTATAAA 

AGGACTCCCATTTGATTGTTCGAMCTATAAMTGGACTTTTACTTTACCATATCTTGAA 

ATGACAAAATATCGCCCTTTGGAAMCCTGACTCTTTGCACCGTTGTAAATTCCCAGAGT 

CTACCT CAGTT AACCAGGC CTTA GTTTTAGGCMGGAATGMTTGAATTAAMTTTCAGT 

TCMTCATTTTATGCCAGMTTTTGTTTTTCTTT^ 

CTGGNTGGCCCTTCCCTCCCCATTTAACTTTTCTTTTTTTMTNCTTTGAAMATTGGGT 
TTAAAAATATTTCCMTCCTTTTCCTTTTCCTCTAGCCAMGTNGGTTITGTNATTTNCN 
AAATAAAAGGGCCCTCTGGTGGAAAANGGGNCTGGAAATTAACCTT 
Sequence 1038 

CGCCNCGCGTCCGCAAGACTTTGAAAAATNNGATCATGGTTCTTCTCAAAATACCAGCAT 

GTCTAGCATCTATCAGMTTGTGCMTGGAGGTTTTGATGTCCAGTTGTTCACAGTGTAG 

AGCTTGTGGAGCTTTAGTTTATGATGAAGAAATTATGGCTGGATGGACAGCAGATGACTC 

MATTTGAATACAGCTTGTCCATTCTGTAAAAGCAACTTCTTGCCTCTTCTCAATATAGA 

ATTCMAGATTTGAGAGGTTCTGCMGCTTTTTCCTGAAACCAAGTACCTCTGGTGACAG 

TTTACAAAGTGGAAGCATTCCATTGGCAAATGAATCCTTGGAGCACAAACCTGTATCCAG 

TTTAGCAGAACCTGACTTGATCAACTTTATGGACTTCCCAAAACATAACCAGGATCATAA 

CTGAAGAAACAGGCTCTTGCAGTTGACCCAAGTGATGAAATAAAGAGAGCCAGTGGGAGA 

TGTCCAAACTATTGAAAATTTCATCTGTGNCTTAATAGGTTTATC 

Sequence 1039 

TGANATTAGCATCACTTCGTCTACTMGAATCTTAATAGATGTAMAATATCTTTTAAAA 

CATATGGTAGGATGGGTAAAATTTGGCAATACTATCCAGGAAGTCACTAAGTACAGATGA 

ACTGATTTAGTCCTAATTCCAAGAAGTGTGATTCCACCTACTTGACTAGAAATTTATACC 
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■ TGGTAATAACTCCTTGTCCTTGMGATTTTCAACTAAGGAAAACTGTTTTTCAGCAGGAC 
CTGATTATGCA CTGCTA TCTAGGTAGGGTCACTTATGGTTTTATAATATATTTAATTGGA 
TTATAATATTCC I 1 1 I I I CTTGTCTCTTGGACAAAATCCTAGCTTTACTGTAATTTAAAA 
AGATGAGTT TAAM TTTCAGGCTTTAAAAACATACCAAACATTGATAAAAATGAAATCTA 
GATAAAAGTATTTTATCAATGTTCAGTTGCCTGGATTCAATAACTGTATTATGGTTATGT 
AAGGATAATATCTTAGGAAATCACATTATGGTATTAA 
Sequence 1040 

GTCGACCCCGCGTCCGGTAGCTTAGTTGAGTAGATAATCTTTtGTTGTTTCCTCCTTGTA 

ATATACAAGCCTTGGCTTCTGTGACATCATACTCTCCTAGATTTCCCCCTGTCACTGTGG 

CTTCTTCTCAGTCTCTGTCCATCCCTGGTGCTCCTGAAGGTTCTGTTCTCAGCCTTACAC 

ACATTACCTGGGTGATCTCATTCTCTGCCATGACTTCACTTGCCATATATGTGCTGATTT 

TCCCCAMTTCCTATTTCTCCCGACCTTTACATCTATTTTAT7TGCAGGTCATATATCTA 

ATAAGGAATTGATATCCAGTGTACATGTAGAACTCCTGTAATTCAATACAGAAACCAAAC 

AGTCCAATTAATAMTGGAGAAGAGATTTGAATGAACATTTTTCTAAAGAACATCTCAAG 

CTCMGATTTCCCAGATAACTTTTCTTTCTTCAAAATCTGCTTCTGTGTTTCCTCATCTG 

TAGGTGGCACAGCATACATCTGATTTCCCAAGCCAGAAACCTCATAGTTATTCTTGACTC 

CAGGAAGAAATATTATTGAGTTTTTAAAAACTC 

Sequence 1041 

cgaccccgcgtccgtgctgmctgagctcaggtgtgtttttcttccaagctttctagcaa 

ggtttcta cttaa aatcacctgtgtgcaagcccaaaggacatttcatctattctaagcag 

aaaggctgtntgttcattacagtgagtgctgttcatctcatggagtgggaggagcacta 

aaccaggagacagaggacatggatttggtttccagcttaaccagttaggactctgtcxjtc 

tgcattctggaaccatgatgcctgcctgcctgcctcacagggctgttgtgaggaccagat 

gagatgatgtatgttcatacttttggmtctctmtttamgtcttmtattttgtcttc 

tgagtgtgaggggataaacctggatgtagactattaagcagcataggagaaaagaacaat 

agaatctaatggactgggtttgcaatctctctctaaatgcactgcttcagacaaagtgaa 

atccaaaggtgtgaaaaagtatagctgcaaattggaaaaatgtgtttcaagagt 

Sequence 1042 

AGTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCX3GCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCTCCCTCCCGGGGGGTGC 

ACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTCC 

TGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCCG 

GCAGCACACAAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCGCCGTCGCGAGCGT 

CTCTGGGACCGGAAGTGCGGGCGAGCGCGGNTCCCCGGGTCTGACAGGAGCAAGCTGTGG 

GCACCGNGGCGGTAGTTGGAGGCGGNAGAGGGTNCGTAGCCGNGCCGNCTGCCCGNCATG 
GGCCTNC 

Sequence 1043 

AGTCGACCCCGCGTCCGCAGGGGCGTGTTGGCCCCGCACAGATTGAGCCGAGTTGTCGCC 

CCGCTGGGAGAAGTGACCCTCCTGCGCCTGAAAAGAAATTTCATTTCAATATAGGTGACT 

ATGCAGCCTGCAATTCAAGTATGGTTTGGAGAAGATCTGCCTCTAAGTCCTCGGAGTCCT 

CTGACTCCCAGACACGGACCAGGATTGGCTAATGTTTGTCAGTACGATGAGTGGATAGCT 

GTGAGGCATGAAGCCACTTTGTTGCCCATGCAAGAAGATCTGTCAATCTGGTTATCTGGT 

TTATTAGGTATTAAAGTTAAGGCAGAAAAATTATTGGMGAACTTGATAATGGAGTACTA 

TTATG TCAAC TGATTGATGTTCTTCAAAACATGGTGAAAACATGCAACTCTGAAGAATCA 

GGGMTTTTCCAATGAGAAAAGTGCCCTGTAAGAAAGATGCTGCATCAGGTTCATTCTTT 

GCTCGGGACAATACCCGCAAACTTCCTTCACTGGTGTAGGGACATTGGGGTTGATGAAAC 
TTA 

Sequence 1044 

ACGCGTCCGCCCACGCGTCCGGAGTCCTCCTCCCCGGGTGCCTGCCCGCAGCCCGCTCGG 
CCCAGAGGGTGGGCGCGGGGCTGCCTCTACCGGCTGGCGGCTGTAACTCAGCGACCTTGG 
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CCCGAAGGCTCTAGCAAGGACCCACCGACCCCAGCCGCGGCGGCCGGACTTTGCCCGGTG 

TGTGGGGCGGAGCGGACTGCGTGTCCGCGGACGGGCAGCGAAGATGTTAGCCTTCGCTGC 

AGGACCGTGGTGMGCCTCTGGGCTTCCTGMGCCCTTCTCCTTGATGAAGGCTTCCAGC 

CCGCTTCAAGGCACACCAGGATGCACTTGCCACGGNTTGCCGTGCCCCCTfSTTTCAGCAGT 

CCCT 

Sequence 1045 

GTCCGCAGAATTGACNAATTCAGGAGGTGTAAAAATAAACAGTGTTCTCTTCTCTACCCC 

AAAGCCACTACTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAAGGCATGC 

ATTAGCCACTACACAAGTCATTAGTGAAAGTGGTCTTTTATGCCTCCCAGCAGACAGACA 

TCAAGGATGAGTTAACCAGGAGACTACTCCTGTNACTGTGGAGCTCTGGAAGGCTTGGTG 

GGAGTGAATTTGCCCACACCTTACAATTGTGGCAGGATCCAGMGAGCCTGTCTTTTTAT 

ATCCATTCCTTGATGTCATTGGCCTTTNCCACCGATTTCATTACGGTGCCACGCANTCAT 

G 

Sequence 1046 

ACCACGCGTCCGCCCACGCGTCCGGGCGGGGGCATGGACTACTGACCCATGCGGGGCAGC 

GTCCCTGTGACCTGGCCGATGAGGAAGTACTGAGCCTGTTGGAGGAACTGGCCCGGAAAC 

AGGAGGACCTTCGGAACCAAAAAGAAGCTTCCCAGAGCCGGGGCCAGAGCCCCAAGCGCC 

CTCTAGCAGCAAACACAGAAGGAGCTCTGTGTGTCGTCTGAGCAGTCGCGAGAAGATTTC 

CCTCCAGGACTTGTCCAAGGAGCGCCGGCCTGGTGGGGCTGGGGGGCCCCCCATCCAGGA 

CGAGGATGAGGGGGAAGAAGGTCCCACCGAACCACCCCGTGCAGAACCCAGAACCCTCAA 

TGGCGTCTCCTCCCCGCCGCACCCCAGCCCTAAGAGTCCCGTGCAGCTTGAAGAGGCCCC 

CTTCTCCAGGCGCTTTGGCCTCCTGAAGACAGGGAGTTCTGGTGCCCTGGGTCCCCCTGA 

AAGGCGGACAGCGGAGGGAGCCCCTGGGGCTGGGCTTGCAACGCTCGGCTTTCTTCTCCT 

GGCTGGAAGGGACCTTCACTTANGCCMGGAGCTTCGTNTTGGCAGAATTACCCCGACCC 

CCTTCCCGAAGCTTGCCGGAGCCCTTNTGTCCTTGTCTGAGGGTCACCAAGGCTTCTTCC 

TTTGCTTGGGAGMCTTCCTTGNCTTNCCTTCAGGMTTCCNGAGCCTGGATTCCCAGCG 

AANCCNAACGTTCCCACAGGCTTTCACGGGGC 

Sequence 1047 

TGTCGACCCACGCGTCCGCTCCCCGCCGAGGCCTCCTGCACCACCTAGAGCCCCACCCCC 

GACCCCACCCCGGGAGGGCAGAGCCAGAAGAAGGCTCATTAGACCTGGGGGACCCAAAGG 

GTCTGGCCTCTTTGGGCAGCCCCAGAGATGAGGGGTCAGCAGAGGAGAGCTCTGGGGTTG 

GGGATGGGTTAGGGACGCAAGCTTGAGTTCTAGCCCTTGCTCTCATTCAGCTGTTGTGTG 

ACCCTGGGTMGACCCTTCCTTGTTTGACCCTCAGCTTTCCCATCTGTTTAATGGTGGCT 

TTGGCCAAGGCAATCCACAAACGTCAAAATTCCCCTTCCCATCAGTACACACACCGATGC 

ACACACACTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACAC 

ACACACACACACACTAGTTAGTGCCTTGGATGAGGCGGGGCAGTGTGTATATGGACCCCT 

GGACTTGCTACCTTCAGGGTTCCATACTCGTCCCTCCCCTCCTGGCTCTGCTGTCTGGAG 

TCTGGCAAGCGGG 

Sequence 1048 

CGCGTCCGCCACGCGTCCGCCGNCCGCCGCCTGCCTGGGCCGGQCCCGAGGATGCGGCGC 

AGCGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCTAACTTCCCCCGGTACGT 

CCCCGTTCGCCCGCCGGGCCGCCCCGTCTCQCCGCGCCCTACGGTCGGGTCCTCCAGGAG 

CGCCAGGCGCTGNCGCCGTGTGCCNTCCGCCGNTCGCCCGCGCGCCCGCGCTNCCCGCCT 

GCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCATGCTGCCCCTCTGCCTCG 

TGGCCGACCTGCTGNTGGCCGCCGGGCCCGGGCCGAGCCTGGG 

Sequence 1049 

NCCACGCGTCCGCAAGGCGCTACGTTTATTGCCTCGTCTTATTCACTGACCTTTGTAATG 

ATACACAGTGMTTCTTTTTGACAAAGAGAAATGCAGTGTAGTATGCAGAGCTGCTGTTT 

TAATGC CTATGCA TTTACTCTTTCCTGATTTAGGCAGAGGTGGCATTTTCTTTATTGCAT 

TTCTCTATTTTTTTAATGTACCCTACCTTCAGTATTCTCTTTGTMGTTGGTGACTTGCA 

TCTGTGGCCTTGAATATTTTATTATCACATGTGGCATAACAGTATCCACACTTTTTAGTT 
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CTTTAI I 1 1 I I I I I I I I I ATTTTGAGCAATTCTCCTGCCTCAGCCTCCCAAATAGCTGGG 

ATTACAGGGTGCATGCCACCACACCCAGCTAATT 

Sequence 1050 

CGCCCCGCGTCCGGGATGGACMGACAAATCTCTTGTAAAAAAMTTACAAGTMTTTTT 

ATAGAAGCTCTGCCCTGAGGGAGGGGGAGCGTGACTTCTCACTCCTTCAGTGTGGGCTGC 

ACAAGTGACTTCCTTCCATATGGGATCGTTATAACAAAAGACTGTAACAAGGGCTATGGG 

AGTTATAAGACAGGAATTGTGGACAAAAACCAGTGTATATCATAACATCACACCTTGTAA 

TGTTGGCAGTACAGTCACTGACCTTTGATAAATGTTGATGACATGTTGAGTAAAGGAATG 

AGAGAAAGAGGATTGTTTATCTCTGTTTTTATCCTTCTCAGAGAAACTTAGAGTAACAAG 

GTGTGTTATCAGCCATGCTGATGCCTTTGGTAAACTATTGTGTGANATNGGGTGGTTTGA 

ATTGGTCAAGTAGAAACTGGGGCTGCCAGGCGCAGCCGGTAAGCATTTNCATGTGAGCCT 

TAGGGAANAAGTGCATTTTGGTAGGAGCCATCAAAMTAGCTTCTTGATATTTCATTAAA 

AG 

Sequence 1051 

GACCACGCGTCCGGGGCTCCTGGGTGTGCCCGCGGCCTCTGGCGCGCAGCGACTCAGAGA 

ACGTCTACGAGGTCATCCAGGACTTGCACGTCCCGCCGCGGGAGGAGAGCGCAGAGCAGG 

TGGACGACCCACCGGAGCCCGTGTACGCGAACATAGAGAGGCAGCCCCGGGCCACTTCAC 

CGGGCGCCGCTGCAGCCCCCCTTCCCAGCCCGGTGTGGGAGACGCACACGGACGCGGGCA 

CCGGGCGCCCCTACTACTACAACCCAGACACGGGAGTTACCACCTGGGAGTCGCCCTTTG 

AGGCTGCCGAGGGTGCCGTCAGCCCAGCCACCTCCCCTGCCTCGGTGGACAGCCACGTGA 

GCCTTGAGACCGAGTGGGGCCAGTACTGGGATGAGGAGAGCCGCAGGGTGTTCTTCTACA 

ACCCGCTGACGGGCGAGACGGCCTGGGAGGACGAGGCCGAGAACGAGCCCGAGGAGGAGT 

TGGAGATGCAGCCGGGCCTGAAGCCCTGGCAGCCCAGGGGGACCCGNGGNCCC 

Sequence 1052 

CGGCTTTGCCGCMCATGCTCAATTCCCGATCATCGCTCAAAGTGCTAMTTTTCAGGAG 

TGAAAAGAAAAAGAGGAAGGAAGAAACCCCTCTCAGGCAATCATGTACAGCCACCCGAAA 

CAATGAAATGTAATACATTCATAAGACAAGTGAAAGAAGAGCATGGCAGACACACAGATG 

CAACTGTGAAAGTTCCTTTTCTTAAGAAATGCAAGGMGCAGGACTTCTTMTTACTTAC 

TTGAAGAMTATTAGACAAAGTTCATTCAATTCCAGAAAAACTCATGGATGAGACTACTT 

CAGAATCAG ApTAT GMGAAATCGGGAGTGCACTTTTTGACTGTAGATTGTTCGAAGACA 

CATTTGTAAATTTTCAAGCAGCAATAGAGAAAAAAATTCATGCATCTCAACAAAGGTGGC 

AGCAGTTGAAGGMGAGATTGAGCTACTTCAGGACTTAAMCAMCCTTGTGCTCTTTTC 

AAGAAAATAGGAGATCTTATGTCAAGTTCTACATCAATATCATCCCTGTCTTATTAGGGA 

TTACCGTTTCCTAAGCCAAGAGTCATGTCAAATTGCAATCAGGC 

Sequence 1053 

GACCCCGCGTCCGGGAGGTTGAACGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAG 

AAGAAGCCAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCA 

GTCACTGACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATC 

ATGGTAGATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGC 

TGTCAGGACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACA 

AAACTTCGGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGC 

CTCAGCTCTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAG 

CAGCCCTCTAAGGAGGAGGAGGAAGAGGAGGAGGAGGAACAACTGAACCAGACCTTGGCA 

GAAATGAAGGCCCAGGANGTGGCGGAATTGAAGAGGA 

Sequence 1054 

GTCGACCCCGCGTCCGCAAGGACCATGTTGTACCACAGCCTCTGCTGAGCTGAGGGACAC 

ATGTCCTTGGTGAAGACCTGCACCCCTGGAACCTCCCACCATCATCACAACTGNAGTCTC 

ATTTGCAGTGGAGAAAAGAACCCGACGTCCCACAGCCAGATATACACCCAGCTCCATGCC 

AGCCCTTCATGTTTACCTTTTGCTTTGTTAATTACATGTCAGACTCCTAGAGGGCCTCCA 

GACTAATAGGAAGCATTTCTGTAACCAACCTGCCACCCACTGATTCAGAAATGGAAATCA 

CATTCCACAATCTATGGCTTCCACCAGCTAGCCCAGGAAATACTTGAAATCAGCATTCCT 
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NAAAAAAAAAAAAAAAAAAAANAAAAANNNANAAAAANNAANAAANAAAANNAAAAANGG 

GGNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGTNCAAAT 

AAAGCAATAGCNTCCAAATTTCCAAATAAAGC A I 1 1 1 1 I I I CACTGNATTCTAGTTGGGG 

GGTTGG 

Sequence 1055 

CCACGCGTCCGCTGTTCCCGACAGCATGGATTAGCTTCCGTGTTCTGAAGTTGTTCTnT 

CAT GGTG TCTGACACCGAGGGCCGTTGTTCGTCCATCAGGCGGGATTGGATGGAGTCTTG 

GTGTTTTGCCTTCTCAGGGAC CAAA AATGTATCATTGACTCCTTAACAGTGACCTTCCTC 

CCAAGGACATATCCGTGTTCATTTTTCATAGGTTTTACTCATATTCATAGGTAGATTCTG 

TTAATGTGAGTTGGAAAGAAAAGACCAATTTGTACACCAGTCACACCACAAGACAGTTTA 

TCATATAAAATACCTCAATTTTTTGTATTCCTCATTTCCACCTCACAATTGTACTGGTGA 

TGAATTTTAAGGGTCTG 

Sequence 1056 

TCGCCNCGCGTCCGGGCAAGGAAGGCTCTGTTAATTTATAGCTGTTTAGAGGGGAAAGCA 

GTGCAGACCACTTATTAAGCCCGCTGAGGACTAGCTTTCTGTCTTTCATACATTTGGGAA 

AGATAGGAATGAC TGTTT CAAAGAAGAGAGGTGCACATAATTTATGCAGGCAAGTATGAT 

ACTTATTTCATTGGTTTTGTGAGTCATATATATACATATATATATATACACATATATATA 

TGACTTGAAAATTMGATTTMTACTTTMTGTTTTAAGTGTGGGGGGTTTGGGAAGGAA 

GGMTGTAATATNTGGGATTTAGCCTTAGGCTTTAAGTTTTAGGCTGGCAAAAGAAATGT 

TATTCAGTGGGTTTGAGGTTTGGACTACTTTCTTCAAACTTAGAGAATTATAACAGGATG 

GTGTTTACCTTTGTTCACCTGGGATGTCCCCAAGACTCTAGCTTCTTCTATCAAGTGGTT 

GGGTCTGATAGAAGAGGTAAAAATTGCTCTTGAAAATNNTCTACAATTATGCAGGTTCTT 

TGATAAAATTTTCTGGTTAAATCC 

Sequence 1057 

CCNCGCGTCCGCCNCGCGTCCGCCCAGGCCTGGGCCGGGCCGTGACGGCGGCGCTAGGAC 

CCGGCGGGCCGCGGGTGCGGCGAGGCCTGGGCGGCCTGAGGAGCGCGGACCCCGGCGCTC 

GGCTCCCGGCGCCATGTGAGGGGGCTCGGGGGCCGCGGGGGGCCGGGCGCTCCCCGCCGG 

AGGTGTGAACCCACATCCCTGCCCCCAGGGCCACCTGCAGGACGCCGACACCTACCCCTC 

AGCAGACGCCGGAGAGAAATGAGTAGCAACAAAGAGCAGCGGTCAGCANTGTTCGTGATC 

CTNTTTGCCCTCATCACCATCCTCATCCTNTACAGCTCCAACAGTGCCAATGAGGTNTTN 

CATTACGGNTNCCTGCGGGGCCGTANCCGCCGNCCTGTCAACCTCAAGAAGTGGAGCAT 
Sequence 1058 

TGGGGCCGGGAGGAAGTCTAACCTTTGGGAGACTCCAAGACCGCAGCTCCGAGGTCGGCG 

GGGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAG 

GAGGACGAGGAGAGGGACTAGGTTGGGCCCGAGGGGGCGCTTGGCAAAGAAGCCCCTTCC 

AGCTGACCGCCGAGGACNGTGTATNACATCTNCTACCTG 

Sequence 1059 

TGGCGATCGCTGAGAGGCACGGAGGGCCGAGGCGGNCCTGGGAGGCGGCCCGGAGGTGGG 

GCGCCGCTGGGGCCGGCCCGCACGGGCTTCATNTGAGGGCGCACGGNCCGCGACCGAGCG 

TGCGGACTGGCCTCCCAAGCGTGGGGCNGACAAGCTGNCGGAGCTGCAATGGGCCGCGGC 

TGGGGATTCTTGNTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAG 

GAGCAGCCCCCGGAGACAGNGGCACANAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGAT 

GATTGTACCTGTGATGTTTGAMCCATTGATAGATTTAATAACTACAGGCTTTTTCCCAA 

GACTACAAAAACTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGC 

CCGTGTCCTTTTTGGAATGACATCAGCCAGTGTGGAAGA 

Sequence 1060 

CCACGCGTCCGGGGTGTTGGTGCGGCGCTGTTGGGGTCCTCCGCTGGCTCATGGCGCCAG 
GCGTGGGAGGCCGAGTCCCCAGTGGCGAGCACTGGCCCGACTCGGCTGGGAGGACTGCCG 
GGACTCCAGAGTCCGCGAGGCGCTGCGGGCGCTGCACGCCGCCAGGGAAAACAAAGAAGA 
AGAGTTAATCGACAAACTGGAGGTGGTCACAATGCCTTCCCCATCACCAAAAGGACTGCC 



WO 01/070979 



PCT7US01/09126 



TABLE 1 

172/467 

AGTGAAGCAATATGCTGTGCAGTCTCAGCTTCCCGTATATGAGTGGCCGGATGTGGGATC 

TGGAGMTATGATGTTGGAGTAGTGGCTTCGTTTGGCCGACTTTTGAATGAGGCTCTTAT 

TCTTAAATTTCCCTATGGCATATTGAATGTTTCATCCCAGTTGCCTCCCGAGATGGCGTG 

GCCCAGCCCCTGTAATCCATACAGNTGCTTCACGGAGACACAGTTACNTGGAGTAACAAT 

TTATGCAAATTAGACCTAA 

Sequence 1061 

GCC GGTT CTTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCCCATGTTCCTGTTGCT 

GCCTTTTG ATACG CCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCACCC 

TCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCAAAC 

TCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAGGAG 

CCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGGATC 

CCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGGTAGTAGTAAGGCTCTGG 

ACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGGAGA 

CCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTGGAGTCCTGGAACC 

TGCTGAGCAGAACCAATT 

Sequence 1062 

CCNCGCGTCCGCTTTGAATNCTTATCTTTGATTTMTTTACACGCCAGCATTTTG^ 

TTCTAAATMTATTTAGCTCAACTGATTCATACGTATTAATGACCATTCTAGCAAAGGCC 

TACAAGTGGTGTGGGAATCAGGGAAAGGCTGCCTCTTTGGTATCTCAACTGGTATTGATT 

ATTGCTATCAACTATTTGGGGAGAAAAMTCAAMTGMGCCCTGTCAMTTTTAGAAGT 

ACTATCTTTGGTCCTTCAMCACmGTGATGACACCTTMGAAAMTAAAGTTGAAGTT 

CAGGTCTTGCCATTGCCATTACAGACAMTTAGGAGACTTGGTTTACCTGGGAACAAATT 

T ACTTG AATATTCAGTACCTGAAACTATGCCAAACCAAAGAGCAGCTGCAGTACATTCGT 

TATTTTAAATGMCAAGGTTTACAMGNTTATTTTCATCTATACCGTMGGNTGGATTTT 
TTTTNAA 

Sequence 1063 

GTCACCACGCGTCCGCCNCGCGTCCGGCGTGCATGGAGGAACGCTGGGCACGGGCCCGGC 

GCGGGTCGGGGGGCGCCCGAGGGGCCCGGGCCGAGCGGCGGCGCGCAGGGCGGCAGCATC 

CACTCGGGCCGCATCGCCGCGGTGCACAACGTGCCGCTGAGCGTGCTCATCCGGCCGCTG 

CCGTCCGTGTTGGACCCCGCCAAGGTGCAGAGCCTCGTGGACACGATCCGGGAGGACCCA 

NACAGCGTGCCCCCCATCGATGTCCTCTGGATCAAAGGGGCCCAGGGAGGTGACTACTTC 

TACTCCTTTGGGGGCTTGCCACCGTTACGCGGNCTTACCANAACTGCAGGCGAGAAGACC 

ATTCCCCGCCAAA 

Sequence 1064 

GTGCCACGCGTCCGCCACGCGTCCGCCTGCCCCTCGCCGCCCGCCGCCTGCCTGGGCCGG 

GCCGAGGATGCGGCGCAGCAGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCT 

AACTTCCCCCGCTACGTCCCCGTTCGCCCGCCGGGCCGCCCCGTCTCCCCGCGCCCTCCG 

GGTCGGGTCCTCCAGGAGCGCCAGGCGCTGCCGCCGTGTGCCCTCCGCCGCTCGCCCGCG 

CGCCCGCGCTCCCCGCCTGCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCA 

TGCTGCCCCTCTGCCTCGTGGCCGCCCTGCTGCTGGCCGCCGGGCCCGGGCCGAGCCTGG 

GCGACNAAGCCATCCACTGCCCGCCCTGCTCCGAAGAGAAGCTGGCGCGCTGCCGCCCCC 

CCGTGGGCTGCAAGGAGCTGGTGCGAGAGCCGGGCTGCGGCTGTTGCGCCACTTGCGCCC 
TGGGCTTGGGGA 

Sequence 1065 

CGCGTCCGAACGGCATCATCACGCCCGCCACCATCCCCAGCCTGGGCCCCTGGGGAGTCC 

TGCACTCAAACCCTATGGACTACGCCTGGGGGGCCAACGGCCTGGATGCCATCATCACAC 

AGCTCCTCAATCAGTTTGAAAACACAGGCCCCCCACCGGCAGATAAAGAGAAAATCCAGG 

CCCTCCCCACCGTCCCCGTCACTGAGGAGCACGTAGGCTCCGGGCTCGAGTGCCCTGTGT 

GCAAGGACGACTACGCGCTGGGTGAGCGTGTGCGGCAGCTGCCCTGCAACCACCTGTTCC 

ACGACGGCTGCATCGTGCCCTGGCTGGAGCAGCACGACAGCTGCCCCGTCTGCCGAAAAA 

GCCTNACGGGACAAGAACACGGCCACGAACCCCCCTGGCCTCACTGGGGTGAGCTTCTTC 
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TTCTTGTCGTCATCGTCCTTCTTCAAGCTTGGCCAGCAACGAGAACGCCACAAGGAAACT 

NGTGAGCCCACGTTNGGCCGTCGGGAAAACACGGGGN 

Sequence 1066 

CGCGTCCGGCCTCCAGCACATCCTGCCTGCAGAGGGTCTGGCTAGCTGCCTTTTCAGCTC 

TCGAGGGATAGAGATTCTACMCCTCCCTCTGTCATCAGTTCCAGAGCCACTCCCCTTTG 

CACTAGAAGTTCTTGCTTTCAAAGAATGAGGGTGTGAGGGAGGGAGGGGTCAAGAAACAG 

AGTGACAGGGGAAACAGGCAGAACAAGTCAGGGCAAAGGACCCCAGCATGAATAGTTGTG 

GAGGTGGAGGTGGGGAAGCAGCCTCACATCTCACACTTCCTTCCTTCTCTTAAATGTGAG 

CAGCTGACTCCAAGCCTTGTGGAAACTCTAGAAGGTAGAACCAGCCATCTGGGGAAGCTG 

GCCTTACAGATGCCCCGTCTGGCATAGTGGGAGGTTCCTGTGCTCTGAGAACCCCAGTGT 

GAATCTAGACATTCCACTGCAGCCTGGGMGMGCCTGTGTTTTTCTTTAAAAAGTCT 

Sequence 1067 • 

GCGTCC GGTTC TTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCACCATGTTCCTGT 

TGCTGCCTTTTG ATAG CCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCA 

CCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCA 

AACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAG 

GAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGG 

ATCCCACTTCACTAGAAGANGAGATCAAAGAGATTNGTCGAAGGGGNCNNAGTAAGGCTC 

TGGACMCACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGG 
A 

Sequence 1068 

TCGACCCCGCGTCCGGCTGGTTTTCCGTCTGGTGAGGGGTTACTTCCGGGTCGGACGGCG 

CTAGCTGCAGCATCGGAGTGTGGCAGTGCTGGGCTGGCCGGCGGGCTGGGCTGCGGCCCG 

CGCGCGGCCGGCGATGCANGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGA 

CCGGGGCTGGGGCTGTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACC 

CCGAATATGACGA ATCTG ATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAAC 

AAGCCMCCTTCCCTTTTGCMGTTTGCAMCCAACTCTTGCTGGGTTTCTTTGCTGGAA 

GCACNTTGAGCCCACTGTGCATGAACCA 

Sequence 1069 

CCGTCCGGGAGGTTGMGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAGAAGAAGC 

CAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCAGTCACTG 

ACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATCATGGTAG 

ATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGCTGTCAGG 

ACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACAAAACTTC 

GGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGCCTCAGCT 

CTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAGCAGCCCT 

CTAAGGAGGAGGAGGGAAGAGGAGGAGGGAGGAACAACTGAACCAGACCTTGGCAGAAAT 

GAAGGCCCAGGAGGTGGCGGAATTGAAGAGGAAAGAAAAAGAAG 

Sequence 1070 

GCGTCCGGTGCTGGAGGAAMTGTTTCTGGGGAAGATGACTCAGTCATTTTGTGGCGAGA 

CACCCTTTGG TAACTC CCACTGACCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTG 

AGCGGAGATGTTTTTTGCAAAATGAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAAC 

CAAGAGAAATCCAAAGACCTGGGGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAGGG 

AAGGGGAAATGAAGTGAAMTGGCGTGAGGGTGTGAGAGAGGTTTGGGTTAGGAAACATG 

TTTTTAGTGCTATTTNCAACCAGGG 

Sequence 1071 

CACGCGTCCGGGACTGATCTCNAGGACCAGCACTCTTCTCCCAGCCCTTAGGGTCCTGCT 

CGGCCAAGGCCTTCCCTGCCATGCGACCTGTCAGTGTCTGGCAGTGGAGCCCCTGGGGGC 

TGCTGCTGTGCCTGCTGTGCAGTTCGTGCTTGGGGTCTCCGTCCCCTTCCACGGGCCCTG 

AGAAGAAGGCCGGGAGCCAGGGGCTTCGGTTCCGGCTGGCTGGCTTCCCAGGAAGCCCTA 

CGAGGGCCGCGTGGAGATACAGCGAGCTGGTGAATGGGGCACCATCTGCGATGATGACTT 
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CACGCTGCAGGCTGCCCACATCCTCTGCCGGGAGCTGGGCTTCACAGAGGCCACAGGCTG 
GACCCACAGTGCCAAATATGGCCCTGGAACAGGCCGCATCTGGCTGGACAACTTGAGCTG 
CAGTGGGACCGAGCAGAGTGTGACTGAATGTGCCTCCCGGGGC 
Sequence 1072 

CCCGCGTCCGCGCGACGGCCGCGGCGCGGGACCTTAGGACCCGCGGGCTCCAGGGCTACT 

GTCCGTCCGCCACTGCGCGCCAGCAGGTCCTGGTCTCCGCTCTCCAACAGCTGAAAGGCC 

GGCGCAGTGAACACAGAAACGAAAACCAAGAAATGCCTTATTCCACAAACAAAGAGTTGA 

TACTTGGCATCATGGTGGGCACTGCTGGAATCAGCTTGCTGCTCTTGTGGTACCACAAGG 

TCCGTAMCCAGGGATAGCMTGMGTTACCTGMTTTCTTTCTCTGGGTAATACATTTA 

ATTCMTMCTTTGCMGATGAAATACATGATGACCMGGMCMCAGTMTCTTTCAAG 

AAAGGCAACTTCAGATACTGGAGAAGTTAAACGAATTCTGACAAATATGGAAGAACTCAA 

AGAGGAMTCAGATTTCTTAAAGAAGCTATTCCAAAGCTGGAGGAATATTTACAAGGATG 

AACTTGGGAGG 

Sequence 1073 

CGCGTCCGCTGAGTTCNAGGATGGTTTTTTCTTGGGACCAGACATGAACAAAAGTTGACC 

TCATGAGCACTTCAACCTCTCCAGCTGCCATGCTCCTCCGGAGGCTGCGGCGACTCTCCT 

GGGGCAGCACTGCTGTCCAGCTCTTCATCCTAACAGTGGTGACAGTTTGGCCTGCTGGCC 

CCCCTGGCCTGTCACCGACTTCTACACTCTTACTTCTATCTGCGCCATTGGCATCTGAAC 

CAAATGAGCCAAGAGTTCCTGCAGCAAAGCTTGAAAGAGGGTGAGGCTGCCCTCCACTAT 

TTTGAGGAGCTTCCCTCTGCCAATGGCTCAGTGCCCATTGTCTGGCAGGCCACCCCCCGG 

CCCTGGCTGGTGATCACCATCATCACTGTGGACAGGCAGCCTG 

Sequence 1074 

CGTCCGTGAAAATCCAAAGATGTATCATTTTTATTTGAATCCATCATGCAGTGTACATTT 

CAGATMTTTCCTTCAGTCTCCAGATAGGAGTGTATCCAAACATCTAATTTTATGTGCAC 

TGTGTATCTTATATGAATGTTTTATTTTATATACCACATGCAAAAATGNCCATATGCACT 

AT7TAAATGTTTTAMTAATATATTCCTTCTTTATAATGCTAAATCTATATGAGTACCAT 

ATTTTTATAAGTCAGTGGTCTGACNGGN^ 

GGTATTCAANGNGAAAAGGGTGGNTGTGAGGAGAANATGTGAAAGNGGNNTGGGNGNNCT 

CTTTTGCCTTTGGGCCAGGAATTNGGGGGGCNAAAATNNACCCANAACCTGGNNNAAAAN 

TAGGNCCANTTGGGGGNGANAGGT7TCACTTTGGGGCNCAAAAANAAAANCCCGGGGTTT 

TTTNTNTTTNCCAAAATANATTNTTTTTGGGA I I 1 1 1 I I I I GTNCCCCCCCCGNATTAAA 

TGGGGANTTGGCTGGNGTCTTGGCNCCTNTCATTTGTGCCAGACC I MINI I ATTAAAA 

AAGAACCTTGGGAAAGGTCTTAAGTNCATTTGGGAAAAAAAAAAAAAAAAAA 

Sequence 1075 

GAGCCGNCCCACGCGTCCNCGGNCGCGTGGGCTACCTTGGAAGCAGTCATCTCTCAGTCT 

TACATTTGGAGAATGTGGATGGCATGACATCAGAATTCCTTTATATMTTTAACTTCAGA 

ATAGTCTGAGATCATCGAAGCACGATGGTCMGGGAATTCCGTTTTTGTTTTAGAGCAAA 

TATGTTTGCTGTTTGTCTTTCATCACAAACATCAGTGGAGTTTCAGCACCTTACAGAGCT 

CAGTGAACCCCCTGGTCACCATCAAAGTTAGCACACAACAAAGCCAACCACGTGTCCCCC 

TCACAGATGACAATGGCTGAACTCTGAGTGAAACCACCTGTATGGCCGGGCACAGTGGCT 

CA 

Sequence 1076 

GCCCCGCGTCCGCTTTTTGAGAATCTCTGCTCTGTTCCTAGGTTCAGTGCTGGGTCCTGG 

GAATACAGCAGGACAGACCTCAGCTTATCTCTTCATAGAAATTATACAAAGAGAATTGGG 

GAGACAGCTAAGAAGAAAACAAAGAMTAMGCAGTTACAMTTGTGATAAAGTGCTTTT 

GAAGGAAAGAAGGGGTCTGAGACAACAACAGGGAAGGGGCCTCTCTTGAAACAGTAGTTG 

GGAAGGAGGCAGACATGCACCAGTGATGTGGTGACAGGTGCTCTGAAGGAGGTCACCAGG 

ACCTGACCTCTTTGAAGGATCAGAAAATACTTCCCTGAAGGACTGACATTTGAGCCTAGA 

CCTGAAGGGTGAGCCATCAAGCTAAGACAATTGGGGAAGAGCATTCCANGGAGAGGGAG 

Sequence 1077 

CGCGTCCGATCTTTGTCTGCTTTCCTATAACTCAGTACTGTAACTCAGTACTCTGAAATA 
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GTTTCCTTTGTTMTAGAGTCACTTTTATAGTACTGNGCTTGAGGNNATATACAGAGTAT 

TGTGTCCMATTTATCATTGCACAMGTGTTTTTGGAAATTCTTGGTTACTCCTTAGTAA 

ATTACCTGTMTTGGGTTAMTGCTGGTAGGGTTTAAAATCTGATTGCTAAAGTGAATTC 

TCTA TAM GTGAGTTTTGATACATAGAMCTTTNCATATAATTCTTAAACTCATGTGTCA 

TGTATTTTCATTTATAGTTTTTCATATTCATTMCATATGTTGTTCCTTACCAT^ 

CTCANMTTCTGCANATGCAGATTTTTGCAAACTTTGATGCATTTGGACAGTCTAGTGGT 

TCGAGTAATTTTGGAGGTTT 

Sequence 1078 

TNCGGGCGGCTGCGGCGGGCGGGCAGGCGGGCAGGCCGGCAGGCGGGTGCGCGGAGGGCT 

GGTACCCCTNAGCAGGTGGGCGGGGTGCGGTTGGNGGCGGCGGCTGGGCCGGGGGCTGCC 

CGGCTGCGCTCGGGCCGTGCGCGGNGGCCGTGCGGGCACGCCATGGACTTCAACATGAAG 

AAGCTGGCCGTCGGACNGCGGGCATCTNTNTTCACCCGGGCCGGTGCANTTCACGGAGGA 

GMATTTGGCCAGGCTGAGMGACTTGAGCTTGATGCCCACTTTGAAAACCCTTCTGGCC 

CGGGCAGACAGCACCAAGAACTGGACAAGAAGAAGATCTTGAGGCAAGACAAGAGGTGCC 

TGCTGCAGCCCCAACCCCAGTGCCCCGAGTGGAGGGAATTANCTGTATGAAGAAGCTTG 

Sequence 1079 

CACGCGTCCGTGCTGAGCGGTCCCGAGGGGAGGGGCCTGAGGCCGAGTTCCAGTCGCTGA 

CCCCTAGCCAGATCAAGTCCATGGAGAAGGGGGAAAAGGTCTTGCCTCCCTGCTACCGGC 

AGGAACCTGCCCCGAAGGACAGGGAGGCCAAGGTGGAAAGGCCCAGCACCCTCCGTCAGG 

AGCAGCGTCCTCTTCCCAACGTGAGCACCGAACGTGAGAGACCCCAGCCTGTCCAGGCCT 

TCAGCAGTGCACTGCACGAGGCTGCCCCCTCCCAGCTCGAGGGGAAGCTGCCATCTCCTG 

ATGTCAGGCAGGACGATGGGGMGACACCCTGTTCTCGGAACCCAAGTTTGCACAGGTCA 

AGCTCMGTAATGTCGTCTTGAAGACGGGATTTGATTTTCTGGACAATTGGTAAAATGTA 

TTAGAAAAATACAATGAAAGAACCCTAAAATGTTTTCCAAAATGGTGTGGTGGAGGAGGA 

TAAAAMGGGCCACCTTTTCCTATGTAT^ 

ATTTGGAAACTGGTCAATACTGNCAGA I I M I I 

Sequence 1080 

GTAAGCCAGGTGCTCCCCCTTCACTTCCTGTGTGCGGAGCACGCTCGCCCTGGGAGTTTC 

ACTAGAAAGAAGGTTGCCATGGGCCAGTGGGACAGCTTGGATCTCAAGTGCACGCGGATG 

CCCCAGMTCCCAGGATCTCAGCTGAGCTGTTTGTGGATTATTAGATCTGACTTAAAAGA 

ATATTATCCAGCAATGCAAATGAACAAACTATAACTACACACAGCTGCATGGATAAATGT 

CAGAAACATGACGTTGAAGTGTGAGAAGCCAGATGCAAACCGAGGACTCACTGTGCAATT 

CTGTGCATGTACAGTGGCCAGGAGMGGGAGCACTGGCTTTTGCTTTCATCAGGCCAAAG 

ATGCCTTTCTTTGGGAATACGTTCAAGTCCCAAGAAAGACACCTCCTCGGAAGGTCGGCA 

TCTTTCTCCMCCTGCATTCTTTGGATCGATCAACCCGGGAGGTGGAGCTGGGCTTTGAA 

TACCGATCCCCCGACTATGAACCTGGCAGGGCAAAAGCCTGAAAGTTTGAAAAT 

Sequence 1081 

TGCTCTCTCACCTGTTAGCTGTGTAGCATTGGGCAGGTTACTTAACCTGTCTGTGCTTCA 

TCTGTGAMCAGGMTAACAGCATTATTAAGGATTGTTTTAGGATTGGATGGGTTMTAG 

ATGTAAAGTCTTAGAACTATATTGAGCATCCCATCAAGGCATTGTATTATATTGAAACAA 

TGGGGTTTNTTTCTCTTTATNCTTTTTTMACTA^ 

ATTNCTAA 

Sequence 1082 

CCCGCGTCCGGTGAATGTTAGTATTGGGTGTGGGATGCATCAGGGACACAGGTTTGTAAA 
CCATGACAATTCAATTGTAAAACTAAAGCCNAGTGCCCCCTGTAGTCCCAACTGCTGGGG 
AGGGTCACTTGANCCCTAGGGGGGGAAMTGCAGTGGACTGGATTTTGNGCCCTTTGCAC 
TNCAAACTGGGTGACATAGTGAGGCCCTTGTCTTCTACCANAAAAAAAAAANNANNANNN 
NNANGGTGCCGGGCCGCTTAGAACTAGGTCTTAGAAGAAAAAAACCCTCCCACAACCTTC 
CCCCTG AACCCTG G AAAAC ATN AAAATG G AATG CCAATTNGTTTG GTTGGTTAACCTTTG 
GTTTTATT TGCA AGCTTTATAAATNGGGTTTACCAAAATTMAGGCCMTTANGCCATTC 
ACCAAAAATTTTCACAAAAATAAAAGGCCA I I I I I I I I I I CACCTGGCATTTCTTAGTTT 
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N^GGGNTTTTGGTCCCAAMCCTCATCNAATGGGTANTCTTAATCATGTCCTNGGGATC 
CCCCCCGGGTTACCCGGAGCCTTCGGAMTTAMTTTCCTTCTTTCCGCTTTTCCTTCGC 
TTCACTTGACCTCCGCTTNGNCTCGGGGTCCGTTTCCGGGCTTG 
Sequence 1083 

TCGACCNCGCGTCCGTGGAGGGCCCATCTGCCAGAGCCTGGAGTCTGCGAAGGCCGGGAC 

CCGGTTCCCCGGCCCACAGTGGGGGTGTGCAAACCCGAGAGAACTGGGAAGTGCCGTCAG 

AAGCGATAACTGACGACGTCTAATGTCTATCTGACCGCAGTCGCTGAAACCTCTACAACT 

TAGTTGACCGTAACTGCCAGAGCCCTGCCCTGAATTCCTGTCCTTACTCCCTCTTTAAGA 

TTGCGTACCCACTGCAGAGTGCTGAAGACGGGGTAGCCACCGAGGTTGCAAATTCGTGAA 

GMTCAGCATCATGTTTGGCAGCTGAGTATTGGAGCCAGGAGCCTGCCATGAGGTTTTGA 

GAACAGAGTGCTGTTTTAGAGCTGGCAGCAGCATCTCAGCCCAAGAGAAGGTTATATTCC 

CAGAGGATGTCAGTCCCAAGGACCAAGTAGCTGCCATCAGTTTGGATTCTGAAAACTAAC 

TGGCATCAAACTGGGTGTAGAAACATG 

Sequence 1084 

CGCGTCCGGACTGTCGCTCTAAAAGAATGAAGGAAATAATAAAGTGATAGACAGGGAAGG 

ATAGAAMGACTTMCMTATACATATGTTCCGTCTTTGCTGTTTTGGAGAATGATGGAT 

AAGTANGTGTTTCCTGATTCTGAAGCATAGCTGAACAATTTMTTGTGGTTTACCATCTT 

TTTGGTTCCCTCTTCAGTAATTAACCTATCGAAAATCTGTCCTAAATGTTTGGACTGGGG 

CACAGTTCCCTCCATCGCTTTGGGAGAAAATCATTAATATGGCATACTGCAGATTGGAGG 

GCAGGACCACTGAGGGTGTCATAGACATTAGCTCTATGGAATTCTGCTAGCAATTTCCAA 

GTGACAGTGAGGAATTATGGATATATGTTTGAGGTCATTCAAGCTTCCTGAGTACCACAT 

TCCCCAGCTACTTAGACACCGGGTTAAAATATTAAAGATGTCCTAGTTCAACAGCTTGAA 

TTCCATTGATTGGAT 

Sequence 1085 

GACCCCGCGTCCGGCTTCCTGGGTTCGAAAGMCCCAGTTCAGGAGTTTCTGTTTTAGTT 

TGAGATCTTATAGGCCTGTCTCATCAGGTTGGTGTCAGCCCAGCTAGGATTAGGCAGAAT 

TGGGTGGGGGCTGTAGTGCATCTTTGGCACAGCATGTACCTGTCTGACTAATTCTCTGTC 

TTTTCTTTCCTGTTGCMTTCATGGGTCTTAGCATCTTCTGMTGGTGTTTAGTAGGT 

TCCTGTTGATTTCCTGCTAGGGAGTAGCATACTCTGGCTCTGTACCATTGGCCAAGGGAC 

TTAAGGATAGGTGAAGGGCTGCAGTTTTGTTAAATGGAACAATATGAAGAGATGGCATTG 

TAAAAAAACTTNTGNCAACTNAA 

Sequence 1086 

T GTCGC CCCGCGTCCGATCAMTCTTGATGMGGATTGTAGATTTTTGCTTTT^ 

GTTTTTAAAACTTATTCCMTTGCTAMTTGGTAGTTTTTCAGTCTTTATAM 

TTAAAAATATATATACAGTTATATGAAATGTTTATTTTCTATGTGTGTGCATATAGTTCA 

ATATTATGCAA TAAAT TTGGTGTTTTAACTTAAAACTATTTCTTATTGTACTTGCAGAAT 

GGATAGCTTGCTTTTAGTAGAAGCATTAGGTCGTATACTCAGATAATCTAATAGAAGGTC 

AGATTTGATTCCTGCATMGAAAGTAGAGCCCAAGTGCTGCAGAAATGGAGAAGAAAGCA 

GGGGCMGGGAGCAGATGGCATTAAGGAAGAATGAAGTTTTTGAAGGTTGGGGATGGACG 

AAMGGGTGTTTCTCATGGAGAGGGGATGCTTTAGCAAAGGCTCAAACATTGGGGCATAT 

TTAGGCAAGAGCCAAGAAACAGTTTGAAGGGGAACATCAGAGGAAATAGGCCAAATTAAT 

AGTAAATN 

Sequence 1087 

GGNGTCGACCNCGCGTCCGGAAATACTCAAATAATGCAACATTACTTCCCAGAAATGAAA 

ATACATTGCATCTCTTATTGAAAGAGCCAACAGAGGCAATGAAAAATAAATGTACACATA 

TGTGCACCCTTMTTCTTTATAATGAACCTTGAGAAATCAAGGCTTTAGAAAAATGCTGA 

GAGACAAGAAGCTTCTGAGCAGCAACACTGGATGCAGGAAAAAAAATGGAGTAATAAGTG 

TAMGTTCTGAGGGGAAAAAAAATATCAGAACCTAGAATTCTAAATTTAACCAAACTGTC 

ATTAMGTAAATGAGAAMGTAAATTCTTTTGAGAAACGCAGAGCCTAGAAAGGATTACT 

AATCGGTAGGATCATCTTTTTTTGTTTGTTTGTTTGGNTTGGGGACAGACTCTCACCTC^ 

GTCACCCAGGCCTAGGTTTGCAAGTGAGCCCAGGATTGTGCCACTGCCCTCCAGCCTTGG 
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GGGTGACAGGAGCAAGGACTTCATCTCAAAAAAAAAAAA 
Sequence 1088 

TCCCCCTGCCCCCCCAAGCTGAGGGAATAAACTCTGCCTGTCGGGTGCCCCACCAAGGGA 

AATAAACATGGCCTAGCTGCCAAGTCATGCCTGTAGGGTGCTCTCTACTGGCAGTTTCTG 

GGTGGTGCATACATCTAGTCTCCCTAGAAGAGCACAGTCCAGAATTGAGGAGGTACAGCA 

GAAACAGACTGCAGGCAGAGAGAGGTCTCATAGAGCTTGGACAGGGCTAGGCACAGAAAG 

AACAGGCAGCGTATCCAGAAGGGGGCAGGGAATGGGTGAAGAGGTTGTGCTCTAGGGCAG 

AGCTGAGCTCTGATCTAGAAAGGACAGCAAAGATACCTGGAAGGCCTCCCGATTCTTGCG 

TTGTTGGCGTCGCTCCCGAAGCCGGG 

Sequence 1089 

CGNTCCTTGGGTAAAAGCGTCCCAGAGACGGGAAGAAATATGGTATAAGCGAGAAGGCCT 

CATAAATCTGGGCTGTTAAAAATCTAAGTTAAAAATATGTTTTAAGTCAGAAAAAAAAAA 

AAAAAAAAAAA 

Sequence 1 090 

CGCGCCTGGTAAAATTATATAAGCTTAAAAAAACAAAAACAAAAACACTTGCTTTGAAAA 

GAGTCTCTCAGCAGCAATTTTGTCCTTGCCCCTACTTCCACAGTTCCTTTTCTTACCATT 

TCACATCTG GATT ACTACATTGGCCTCTTTGCTTAGACTCCCAATATTCATTTGCTTTCC 

TTCACCCCATTTTATGGAGGACTGTCAGATCMTCTmAAAGATAAATTTTATAATGTT 

ACTACTGTTGC CTATT GGATTAGAGCCCTAGGGGTGCTTTTTGTAGTCTCACTGACAGCT 

GACATTAGTGATTTTTCACCCTCTTCTTATTGCTACCCTGTGTTGATGGCCAGTTTCCAG 

GTGGGCACCTGCTCCACTTGCTTTCAT 

Sequence 1 091 

GGGGTATGTGTGGTTCTTCCAGGAAAGTGCTGAAAATATCACCCAGGCCTCTGCGCCACG 

CCCTGGGAGAGTACACTCCTGGGCTCACGCCTCTGCATTCCAAGGCTGACAGCTAGAAAT 

ATACTTTGTAAAATACCAACAACTTATTCACAAATATTCCAACTATCTACCAGCTCCAAT 

GAGCTTGCTGAGGATGGGTATGACCCCAGTCTAAGGGGAAAGAATCTAAAACACAAGTAA 

ACCTGTTTAAAGGCCAGATCTCCAGATGGAGATCCAAGCAGATGGCGCCTAAGGTTTGCC 

CTTGAAAACTACCAAGGAAGCCACAGAGAGGGATCTTTGGACCTTCTGGAAAATGGTAAG 

GCCCCAGGTAGATTATGGCTCCTCTGCCCTGGAGGCTGAGCCGCCCTCTGGTTACCTCAC 

ATCTTCTGGTTTCTTCTGAGTGGGACTTGATCTCATTTCTGCATTCACAGCAAGGNGGAA 

CTGTCTGGCAAGAGCTTAAAATTAGGACCTGNTGGTGGGGACCTTTAATAGCAGGTGGAG 

GGTTTGAGATCCCNTGAGATGCCCNAGATTAATTCAATAGGGGGANGAAAGATTTGGCCC 

AATTCAAAAGNGCTTAAAAAAAAATTTT 

Sequence 1092 

CGCGTCCGGTTCTTTGGTTGAGCTTCTTTGTATCAGTAACAAAAGGAAGCATCATTCAGT 

GTTTCTTTGTMCTTMTGTMGTCTCTTTTGTACATCCTATTACTTCAAATCATTGAAG 

TGAACTCC ATTTT CACATCTGTTGGGAACAATCATCTAGCTTCTTAAATGACTCATCTTA 

AAATATGAATTTTAGACTGCCTAAACATTCTGAGGGAGTACAGTGTGATATAGCAGAAAC 

AACCGGGGCTTAAGAAGGATCAAAATGAAAGGTTTTGTGAAGGATGTGCCAGAGACTGCT 

CTCTGTTCACCAAAACCACATTCTTCTCTTCTTCCTGGCATAGAGCCAGACTAGATTTCC 

AAGCTTTCCTTGCAATTGAGCTCTCAGAGTTCTCGTCATTAGAATGTGAATGATAGGCCG 

GGCGCAGTGGCTCATGCCTGAGATCCCAACACTTTNAGAGGCCAAGGTGGGTGGATCACT 

TGAGGTCCAGAGTTCNAGACCAGTCTGGCCAACGTGGTGAA 

Sequence 1093 

CGCCCC GCGTCC GATAAMCTGGATTTGATTTCTTTTTATGAAANGTTTCATATGAATGT 

AACTTGA I I I I 1 1 ACTATTATAATCTAGATAATATGATATAAGAGGGCTAAGAATTTTTA 

MTTGMTCATATATATGATATAATTTGATCCTTCTTGTATCTTGAAGTTTTGTACTTGG 

GATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAAATATTTTCTTGGAGCTCT 

GTGTCAACTTTGATCCTTTGTCTCCCAGGAAGGTGTGACCTCTCCTTTGCCTGCATACCT 

CAAGGCCAGGGGAATATGCCTCAGTGATGCATTTATCTTTGTATATCAGGCCGCATGATT 

CCCMCTTTCTGCCACACTTAMTTACGTTCCTCCATTTCAGTTTTGTCTTTTCTGTCTA 
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AAGT TCAGTCAAAGAGTATCAAAAAATTATGTTTCAGCTAGACTGGTGTAATGTATAAGT 

TTTTGTATCTTGTATTAGAGGATTTCGTAGCTTTTATTAGAGG 

Sequence 1094 

GCCCCGCGTCCGATCCCTAGATGACATAACAGCCTTACAAAAGGACAGGGAGGAGTGTCT 

GTTCCTACTCTCACATAGCGGAGGAAAGTTAGAGCCTCTCAGTCTCTGTTTATGAGGACT 

CATTMTCTCAAATAATTGATGCATTTTTCATACATTAGGGTCTCTGTCCATGTGTCTTC 

CTGATATTGTTATAGAMTGGCTTCAGGCTGCTGGTAACAGATGCTGCGGAAAAAGAATG 

CCTTAAACAAAGCCAGGCGCGGTGACTCACGCCTGTGATCCCAGCACTTTGGGAGGCTGG 

GGTGGGAAGGATCACTTGAGCCTAGGAGTTAGACACCTACCCAGCCTGGGCAACACGGTG 

AGACCTCGTCTCTACAAGAAACAAATAATTGGCTAGATGTCGTGGCGCACAAGCTTGTGG 

TCTCGGCTACTTAGGGGGCTGAGGCGGGAGGATTATTTGAGCCTGGGGAGGTCAAAACTG 

CGGTGGGCTGGGATTGCGCTACTGCACTCCGGGCTGGGAGACCGAGTANGACCCTGCCTT 

AAAAAAAAAAAAAAAAAAAA 

Sequence 1095 

AGTCGCCCCGCGTCCGCTCATACCCAGTGAATCTTCAACAGAATCTCTTAAAGATCTCCA 

GGAAGTATMGCTCTCATTMTGTTTGAGTTAGAAGAACTTATTCTGGGCCTTTAATTTG 

TTGCATGTGCTGTACTTAMGCATCCCAGATAATTTTAGCTTATATTTTCATAGTGTTTA 

TACAGAGCTTGAATTGGMTGGTCCTTTCCTTCCTTGCCTCAGTACTTCCTTCCATAATC 

TTTCCTGCCATAACCATTTATTTTGCCACCATTTCTTAAACACTTATGTGGCAGGCATrA 

TGCTAGACTGTAATATG I 1 1 1 1 1 I I AATCCCAGTTGAAGTGGATGTGGGAAGGTATTAGA 

AAGTAGAAGAAAGTATAGTCTAAAATAGAGAGGAAAGAAAGGAAGAGAAAAGTGGGATAT 

TTCAAAACCATTTGCGCAGAGGTAGAATGAAATTCGCCAGAATGGGAATCTCCGTATTTC 
TTTACAAT 

Sequence 1096 

GTNGCCCCGCGTCCGAGTNAACAGTGGTAGTNAAATTCAGGGTTGGTAAGTTTTT(XATA 

GAAGGCCAGATGGTAAATATTGTAGGCTTGCAGACCATGTGGGCTCCACGACTCAACTCT 

GCCACAGTAGTTTGAAAGCAGCCACAAACAGCCTTGGTGTGACTTTGTTCCAGTAAACTT 

TCTTTATAGAATGGGAGAAAATATTTGCAAACAATGCATTCAACAATGGCCTGATGTCCA 

GAATTCATGAGGAACTTAAAAAACTCAACAACAAAAATCACCAAATAACATTTAAAAGTG 

GGCAAMGATATGAATAGTCATTTTTCAAAAGAAGACATACCGAATGGCCAACAAGCATA 

TGAAAAMTACTCMCATCTCTAGGCTTTCAGAGGCATGCNAANTAAAACCNCCATTNGA 

TATTATNTTACNNGANCCCNAAATGG G 1 1 1 1 I I I I A AAAGGCCAAA 

Sequence 1097 

CCCCNCGCGTCCGTTNAAGTGTGCTTTGGAAAAGGGAAAAAGTCCTAAGTAGATATAAAA 

CCCTAACTAAGGA AGAA AGCAGGTAGCAGTGGTGGTCCAAGAGACCGTGTAGTGGATGCA 

AGGACCGCTCGTATTTTACACGCTATATTTCAGCAAAGGGTGGCCCATCTGGCAGGAAGA 

TGGGGACATATGTCACATATAGAGCAGTTAAGGAACTAGGGAAAGTGGAAGACTCAGAAG 

ACCTGTCTTTGACCTGGTATGTTCTATCTCTACAGAACCfAATATGGCTTATACATACTG 

CCACAGAAAGGACTGAGGTAGACAGTGGCAAAGACTTCCTAGGAGTTGAACCCCTGAAAT 

TACATAAGGAGTAGGACCCCACCAGAATTCTGTCTTTGTAGGCTGCTGACTGCAGAAGAA 

ACGGTGTAGCGGAGGCAGGGGGAAGAGGAGTCAGNANAGTACACTGGGAAGGAAGAAACG 
GGTTCTTTTCTCTT 

Sequence 1098 

TCGCCCCTGGGCCCTCCTAACCAACCAGGGGAGGGGAGAAGGACCCAATTCTTTTCCTTT 

GGTGCACGTAGCCTGGACCCGTTATGGACAGAGGCCAAAGGAAGATAACAGTGTGGTGTC 

CAGAGATGAAACCAAGTGGTTGATGGGCAGTTCTTTGAGCAACCTTGTTTATGAGCCTAT 

TGATATGCAGATATAGAGGCATCCAATACTATTGACTAATTTAAAATCTTATTCAGTGAG 

TCMCACTCTMATMGCAATGGAGATGGTTCCATTCATTTTTTTGCAAGTATCATTTTT 

ATMACATAAATTTCCTGAGATTTTTGTTTTCATCTTAGCCTCTGTGGAGCTGCTTCGTG 

GTTATGATAAGTGCTGTGTGATGCTCACCTTGGGAGGTCTGCGACATATATTGAAGTCAT 

CTCTAACCTGAAGTACTGACAGACTTTCTGGAAGAAAAGGCTTGTAGGAGGAAACTTCAG 
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AATTCTATTAAATGGTGTAAATGATGAAATTATAGTTGATATATGCTAGAGCATCAGTGC 

TGGGTATTTTAGAAGGGATGGA 

Sequence 1099 

CGGGCCTGTTCCTAGAGCCTCATTGGAGACATTGACAATGCCATGAGGACCTTCCTCAAC 

TACTACACTGTATGGAAGCAGTTTGGGGGGCTCCCGGAATTCTACAACATTCCTCAGGGA 

TACACAGTGGAGAAGCGAGAGGGCTACCCACTTCGGCCAGAACTTATTGAAAGCGCAATG 

TACCTCTACCGTGCCACGGGGGATCCCACCCTCCTAGAACTCGGAAGAGATGCTGTGGAA 

TCCATTGAAAAAATCAGCAAGGTGGAGTGCGGATTTGCAACAAAAGATCGCTTTGGCTGC 

TTTGTGAAGAATAGATTGAAAGGGTCAAAGGTGAGAGCCATCTCACATCCATGCAGGAAC 

CAAGCAGGCAAGATATAAATATGAAAGTAGAAGAAAATAGTCTGGAAGAAAATCCC 

Sequence 1100 

CGCGTCCGGGAGTGCACCCCAAGATCTMCAGCTGTTTCAGAGCTGCTCATTTTAGAGTG 

ATTGGTAGGGAGTTGGTGGCTCAGAGGTCCTAATCAGAATGTGTCCTGGGTTCTGAATGA 

CTAGCAGACTATCATTAACCAAATMATTATGGGATTTTGTCTTAATTATATACATATAC 

ATATACACACACATACACATACACATACATGTGTATATATTCCCTAAAACTTAATAAAGC 

TCAMTAA TAAM TCAGATTTCTTAAGTATTCCAATTCCCTTTAAAATGTAAATCAGATT 

TTATA ATTCTTTTGTTCAAMCTGTCCATTGGCTCCCATTTCACTTAAATCAAAAGCTAG 

ACAATAAGCTAAGATAGCAAACATTATTATCTATTTACTTATGAGTTACTTATGT 

AACTCAAGCATCCAATAACACTGTAGGGTGCTCAATAAAATAGTTGCTGAATGGATAACT 
TTC 

Sequence 1101 

TGTTTTACGACAGAGCTTAGTGCAGCCNGTTCTTGATGGCTGTGCAATGCTTTCCTTTTA 

AGAGTGGAGTTAGCCTCGTCATAAAGCGTGTTTTTGAGTCTGTTCGAACGGGTCAACAAC 

GAAGGGAAGTTTCAGGCAGATCTTGTATGCCTGGCCCTGGTGGCTGCTTTCATTTCCTTC 

CAGTATCAGTGCTAAACAGGAATGAACATGTTCAAGCCCCGTCTCACCCACCTCTGGCAT 

CTTCGCCCTMCTCTGCCCTAGMGACCTTTCCTTCCGTATCGTCAAGAAAACTGAAGTT 

GCTGTTTCACTCCTTCTCCCACCCAGAAACTTCGCTGCATCTTCCTGGATCCCTAGCTCC 

TTGCACCCATGATCCTGTCTCCTTCCTCAGCCCGGCTTCTGGCTGAGCAGCCTGCACTTG 

CTGTCTTCACTCCTACACGCTGCCCCCACTCCTACACGCTGCCCCTGCGTGCTTNTCACT 

TCTCTACCCTTCC 

Sequence 1102 

GTCCGTATCCTAT CTTCA MTTTTTTAAATATGTTCAAAATATCTGGAGGGTGAGAAGTT 

ACCMGTTTGT ATGTT TTGTTTGACTCACCATCTTTATTTTCTGTATATGTAGTAGCTGG 

CMTTGCATATATTTTCTTGATTAACATATTAGAGACTGCTTCCATCATCTTATGTAAAC 

CTGGAAACAAGCTGAAACTAGTCTTTTCTGAAGAACCGTGTATCAGTGTTAGATGTGCAT 

CCCGT TTTGTCA TTCCCTCAGACTTTGAATACAGTCATTACTCTCTGGAAGAGAAATGTA 

AGTATATTTTTTGTTATCTGCAGTATGGTTTAACATGTATTAATAATACACATATGCAGA 

CTCACTAAAGTATCCCCAGTAATTAGTAAATTCCAAAT 

Sequence 1103 

ANTTGGGTACCCCCGGCCNGGCCAGNTGCGCGCGGCGGGGCATGCTGCTCGTCCCCCCGC 

GCCCCCGGCCCGGACACTTGGCGGGTGCCACGAGGACCCGCAGCAGCACGTGCGGTCCCC 

CGGCGTTCCTGGGCGTGTTCGGCCGCCGTGCGCGGACCTNNGCGGGAGTTGGGGCNTGGG 

GGGCGGCNGCCGTTGGTNCGGACAGNCNGGGTGCGCACTTGGGCCCCCNTGNCCATGGCN 

GCAAAGGTGGACCTGAGCACCTCCACCGACTGGAAGGAGGCGAAATCCTTTCTGAAGGGC 

CTGAGTGACAAGCAGCGGGAGGAACATJACTTNTGCAAGGACTTTNTCAGGCTGAANAAG 

ATCCCNACATGGAAGGANATGGCGAAAGGGGTGGCTGT 

Sequence 1104 

TGCCNCGCGTCCGAGCATCTCAGGTMCMTTTGAGCATAACTTTAACCATAACTTATGA 
TAGCATAATAACATTCATTAGTAATTCAGTAGCCGTATGTGCCAGGCTGTGTTAGGTGCT 
TTATATATTGTTTAATTTTTAAAAACTTGTGGAGTGTACAGATTGGTAAGGTGACATTGT 
ATCACAMGCTAGTCTTTGAGTCCAAAGTTTTGTGGTTTTATGTTATGATATACTTTTAT 
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CATGGMTTGTCTTATTAMTGTTTTGCCAGTGGTTCTTAMGTGTGTTTCTGACACCAG 

TAGCATTGACTTCACTTAGAAACCTGTTAGAAATACAAATTATTTGGCCCCACCCAACAC 

TTGAGTCACAAACTTTGCAGATGGGGCTCMTCTGTTTTMCAAGCGCTTCATGTMTTT 

TGATGCAGGCCTAAGTTTTTGAGCCCTGCAGTATGCATTTCTATTTTTAAGCAAAGATCT 

TGGTCTTTCTTTTTGGACATTGTAGAMTAACAT 

NTTTGGTTTGGT 

Sequence 1105 

ACGCGTCCGCTCTGGTCAAGCAGGCGGTACTTCTCCTTGGATGTCTCAGCCACAGTGCCT 

ATCAGGGTACTGAGGGAGAGCACACATGGCCCGAGGCCCTNGGAGCCCTCGGAGGCTGAG 

TCAAMGAGTCTCCCTCGMTTGGTGGGCCTTTAGAAGACTTGGCTTCTTCACTGGAGAG 

CTATAMGTAAACACCACACTGAGGGCCCTCGTCCCAGGMGGCCTTCAGAGCATTTTCA 

TTTCTGAACACGTCCCCTCATCTTTCAAGATTTTCTGGTCCTCTAAAGCTGAGAAACTAC 

AAGCACTGAMTGAGATGAGTTTTGATAAGGATGGTAATGAAGCACAAAAGCGTTATTCA 

CATTACTCACTGACTTTAATATMTTTTGAMTATTTCATACTTTTGAAAAACAAAATAG 

CCTGGGCGACAAGANTGAGACTCCATCTCAAAGGTAAAANAAATTTAANCTGGGTGCCNG 

CCGCTTGACTATGTCTAGAGAAAAAAACTTCCACA 

Sequence 1106 

GACCCANAGAAAAGGNGCCAAAGGGCATGTCMGCAATTGAAGTTAAGCTCATGTTTTTA 

AAGATCCGTTTATTGAGATGATTTTGAMTGCTCCTTTACCATTCATAATTTAAAAATAA 

AGTTTMCMTGGTTTAAATTCANAATGGATTAAAATGGAGTTGGGGGTGGAAAGTAGAG 

CCATTCTTAGTAAATATAAATAACTGAAAAGTTCTTCTGAGGAGACTATGTACCGAAGTT 

ATCATTGCATCTTTCAGTATAGGCAGATCTCTCCCTCATATMCCGGATGTTTCTTGGCG 

CTTGGAATATCAGATAMGGTAAAGTTTAAGAAACTTCTCTAGCGGGGGATTTAGGGAAC 

TTCTTAAAACCTAGAGTTAAMGCTGTTGCGTGTTTGTTGTGTTATTTTAGACCAATCAA 

CTTCATAGGCTAGACTAGTCTAGA 

Sequence 1107 

ACGCGTCCGAAAATTCACAGGGTGTGTTGGCACACGCCTGTAATGCCAGCTACTCAGGTG 

GCTGAGGCATAAGAATTGCTTGAGCCTGGGAGGCAGAGATTGCACTGAGCCGAGATCGCG 

CCACTATACTCCAGCCTGGGCAACANACATCCTGTCTCAAATAAATTAAATTACATTAAA 

TGTTTAAGAAGAAGTCTAAATAAGATTCATATGCTGCCCTCCCTCAGATAATGAGGGAAC 

CTGGGGTACTTCTGGGCTACTCTGGGGGACAAAGTATAACTATTCAAATGGCAAGTTGAA 

TTAGTACAGTCTAGGAGCCTTGGAGATGGCTTCTTGAAAGAGGTAGAACCTGAAATTCTC 

CTTCCTTGAGGGACGGNCAGGATTTGGCCAGATGGAAAGGCAAGTGGAAGGCTTTGCAGG 

GACAAGCAATGTAANCAGANCCTAGAAATGG 

Sequence 1108 

TCGACCCCGCGTCCGGNGTMTTCTAGGGGAMTNATATTTCTGAACAACAATGTTGGTT 

TGTGCAGGAAMTCACCAMGMCATGACTAGAMGTGTATAGCTACAGTTTCCCTCTTT 

TAAATGGGAATAGCAAAACATATAAAGAATATTGATAGGCCGGGTGCGGTGGCTCACGCC 

TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAAGAGGTCAGGAGATCGAGAC 

CATCATGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGC 

GTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGA 

ATCCGGGAGGTGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGA 

CAGAGCGAGAGACTCTGCCTCAAACAAAAAAAAAAGNNAAAANAAAAAA 

Sequence 1109 

CCGTCTCTGGCTTGGCCAGGTTTAAATTAATAAAAATGAAGATGAAAATAAGTTGTCAGA 

TTTAGGATGTATTTTAGMACCCMCTGATMTTTGCCAACTAATTGGATGCAGAGAGTA 

AGAGGGAGACTCMGAACACCTCTAAGATTTTTACCCTGATCAATGGGATAGGTGAAAGT 

ACATTAATGGAGATTGAGAATCCTGGTGGAGGTACAAGTTTAGGGGTACTGAAGAGTGCT 

TTTGGACATGTGAATTCTTAGAAGCCTACTAGATTCTCCAAATGGAGACATAAAACATAA 

TTGAATACAAAAGTCAGGAGTTCAGGAGAGGGCTGAGCTAAAGATACAAATTTGATAGAC 

ATGAGCATTTAAAAAAAACTGCATGAAAATACTAAAGATAGGCTGTCCTGCCTATGGAAT 
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AGCCATTCTTTGATCCCTTTACTTrCTTAATAMCTTGGTTTCACCTTACTCTATGGACT 

T CCCC CAAATTCTTTC7TGTGTGAGGTCCAAAAACTCTCTGTTGGGGTCTAGATCAGACC 

CTTTTCMGTACATCTTNCTGATGAACCACAAANGGATTATACTAAAGAGACCCCCCACC 

Sequence 1110 

CCGGNAAT TTGC TTATTCTAATTGAGACACANTGGTGGGGAGTGGGGGTCGGGGACTACA 

CAGGT GCATT TTCTGAACATTTATAAAATGAAAAAGATGGAGGCTTGGCTAGAATGGTTA 

ATCCCCTTTTCATTCCTCTMTTCTATGACAATTTT1TTAAAAAACCAAACACAACCAAA 

ATAAGAGTGGACAGTTGAGMTTACCTTTAGGTTCCCATGACCCTGAAGACTGTATTTGG 

CCTTGGATCCATTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1111 

CCCACGCGTCCGCGGCCATTTCTGTATCCCCCTGCCTGGGTTTGCTGCCCTTTATGCTCC 

TACCTCACCAGGTACAAGGAACATGAAGATGGCTATATGCGGCTGCAGCTGGTTCGCTAC 

GAGAGTGTAGAGCTGACACAGCAACTGCTGCGGCAACCACAAGAGGGATCGGGCCTGGGA 

ACGTCGCTGAACGAGAGCAGCCTGCAGGGCATTATTCTAGAAACAGTGCCAGGGGAGCCA 

GGACGTAAGGAAGAGGAAGAGGAGGGCAAGGGTAGCGAAGGGACAGCCCTCTCAGCCTCT 

CAGGACAACCCCAGTTCTGTCATCCACGTGGTGAATCAGACCAATGCCCAAGGCCAGCAA 

GAGATTGTCTACTATGTGCTGTCTGAAGCCCCAGGGGAGCCTCCCCCAGCCCCTGAGCCA 

CCTTCAGGGGGCATCATGGAAAAGCTTCAAGGAATAGCTGAGGAGCCAGAGATCCAGATG 

GTTTGAAGGCCGCAGAGCCAGACCATTTCTTCCCAGGTCTGAMGTTTGAGCCAGGCAAG 

TGGCAGTGCCCCTAGTGGGCAGCCGTTGCCAATGGATGCC 

Sequence 11 12 

CCCCCGCGTC CGTM TTTTAAAGA ACCTTG TTATTAGAAAATCTCAGCCTAATACAATCT 

GAAGTTMGAGTTTTAGCAGCATTGTTTTTCTMGTAGATTTAGCTATAGATTT^ 

GGCCAMCMGGMGAGTATATGCCCTTGTAMTGAGTCTTGTTTTGTTATTTAAATAGT 

CAGTCAAAACGTAGAAATCAGTATACGTAAAATAAAATGCATGAGACTATTAAATCTTTT 

CATATACTCTACAAATAAAATGAAATCTGTGTGTGGTCCTGGTTGACTGGGCATCTAAAG 

GGMTC AGA AAAG AGATTGTGAAMGTTATATATATATCCTCTTCCTTATTTTAGTTTTG 

CTTTTTCCTATTTTCCATMTTMGTGCCGTTTACAMGTGGCATCAAAAAATTGAAGCA 

GGCCAGGCATGGTGGCTCATGCCTGTGGTCCCAGCAGTTTGGGAGGCTGAGGGCAGGTGG 

ATCACTTGAGATCGGGGGTTCGTGACCAGCCTGGCCAACATGGTGAAAGCCCATCTCTAC 

TGGGAATATAAAAATTAGCCCGGCGTGGTGGCATGTGCCTGTGGTCGCAGCTACTTGGGA 

GGCTAAGACAGGAGAATTGCTTGGGCCCTGGGAGGGGGGAGTTCANNNGNCCTGGANCGN 

CCCCTGNNCTCNANCCCTGGCAACCANTGNNGACACCNNCTTAAAAAAAAAA 

Sequence 1113 

TCGACCCCGCGTCCGGTTTTTTGTCCCAGCAGTGGCATTTAMTTACTGTACTTTAAGA^ 

ATGGAATTGCTGGAGGCTTGGAAACTTGAGTGCAATTTCCCTAGTACGACCTCCAAGGAG 

AATAGAGCAAMCAGTGGTAGGAAAMCTCTCAMATTTTACCCMTTGTATGTTTTCTA 

CATTGTCAGTATCTAGTTTTATATAGTTMTATGTACTTCTAAMTTTCTGACAGTGNTT 

GGTGTATAAAACAGACCAAGCTCAAGATGTAAAGAAGATTGAGAAATTCCACANTCAACT 

AATGCGACTTATGGTAGCCAAGGAAGCCCGCAATGTTACCATGGAAACTGAGTGAATGGT 

TTGAMTGMGACTTTGTCGTGTACTTAGGMGTAMTATCTTTTGAATTAGAGAAAGTG 

TTGGG ACAG A AAGTA CTTTATGTMCTMGTGGGCTGTTCAGAAGCTTAGAGGTCATTTT 

TTGTAATTTTNTTTTTMTTACTTTAGM 

AAG CCTTTATTT ACTTTTTG G G A AA 

Sequence 1114 

TCGACCCCGCGTCCGATTCCTTCTTCATATATTATGTCAGAAGAGTTTGAGAAGAAATGG 
TATTAATTCTTCTTTMATGTTAGGTTGACTCACCAGTTAATGCAGCTATTTGGTCATM 
ATG TTTC TTTGTTAATCACTTTCGATTACTAATTCAATCTGCTAGGTTATAGGTCTATTC 
AGATTTTCTCTTTCTTCTTG^ 

TTTCATCCAGGACAGCTAATTTGTTGTTAGACAGTTGTTCACAGTATACTCCTGTAATCC 
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TTTTGTATTTC TGTAM GTTGGTAGTMTGGCTCTGCTTTCATTTATTATTTTAATMTT 

AGTCTTCCATCTTTTGCTCAGTCAATATAGTGAAAGGCTTGATCTTTCAAATAATCTATG 

TTTATTCATTCTACTGCTCTCCAAACTTCTATTTTATTGATTTATGCTCTAATTATGCTC 

TCTATTATTTCTTTCATACTGCTAGCmGGATTAGGCTTATTTT 

Sequence 1115 

GCCCCGCGT CCGGG ATGACTAATGAAAGCAATNAGCTTGAACATTTAGAAAAAATTCATA 

TATGATCTAMTTTTTATATTATCATTTCTGTGCCTTCTAATTCCTGCATCCTGTTCAAA 

ACATCTTTCCAGACATTAACTTACACATTGTATAAAACCGACCAAAATGATTTCCTAAAG 

TTCATGCAAAAAAAAAAAAAAACAACCTAATTTTCTGTTAATATAAAAGAAACTTCAGTT 

TACTGACCGTGAAACAGACTATGTACTGACATCCAGGGTAAAGTAAAAGACTTTTAAATA 

TTGGTCATTAMGGACAGGAGCTMGCTAGCAAAGCAAMCATCTTTAGCACTTTGCAGA 

TCTC MGCAGTTMCCAGGCTCTGATTCCCTTCCACTGTTTTATGMTTAATTCCAGTTC 

TmCA TGTATC TTTGMCCTMGATTATGMGTMTTTCCCTATTAGGGACTAGAATGA 

CTTCAGl I I I I iCATTTGATAAAMTCAGMCTGCTACCTTTCCCTTTTTTAATGATGCA 

AAATGTAGATGAGTGCATTMGGGTTGTMGGATCTTTATCATTTTATGNCATTCATTGA 

AAATTGAAATGTTCATTCTTTTTAATGGTT 

Sequence 1116 

CNGCTTTCGTCTCTTCCTTTNAAGTTC^ 

TGGGACCTCCAGAATMTGTTTCATGMGTAGCATGTATCCATATCTGGTTCTTGACTTT 

TTCATCATTATAATTGTTTTCTATGGGTTACTTATCAGTTTMGAATGCTTMTTCCTAG 

ATGAAC TAAGAG TGTTTATTACATGTTGAGATTTATGGTATGCTTTTTCTTCCTCAAGAT 

AATGCATTTTTTGTA TTATCTGT TAATGTGATAGGTTATCCATTTGTGTATTTTCAATCA 

TTGMCAACCC TTGAT TTTTTTGGATAMCTCTATTTGGTCATTATGCATCATTCTATAA 

ACCCTGCTGM TTTTTCATTT GCCAACATCTTATTTCAGATTCTTTTAATCTGTGTCCAA 

CMTGAGATTTGTTTTCTTTTGCMTTTGTTTTGAATTTTTGGTATC 

CCTTATAATGGAAAATACATATTTCTCAAACTTTTACACTGATATATTCATAGTATTTTT 

TTATMTTTGAAAMTCTTGTCAGTATCTGTATTAAGGCCTNCA7TTCAGTTCTGCTATT 

TCATATTGCCTTAAGGTGGCTATTTGGCTCTTTAAGGACCCCGATTTTGATTTTGTCATT 

TTTAAAATAAACCCCATTTATGCTATAAAAAAAAA 

Sequence 1117, 

GCCTTTTATGGTGATGGMTATGTCTCAGGAGGAGAGCTATTTGATTATATCTGTAAGAA 

TGGMGGGTMGCTGTTCTGCTTTMTTCTGTATGTATTTTGTNNCTNGNCCTTTATCCT 

TTACTAGCATCAAMTGTCAGCMCCAATTTTAAGAGGTCTATTTAATAACCAGTTCCCT 

TAGTCATATATTTGTTTAGMTCATAMCTATGTAGAACTAGAAAGGATCTTAAAGATTA 

TCTCCTTAGCCTGTTTATACAGATGTGGATACTGAGCCTCGCGGCTTATATGATTGCTCA 

CAGTAACGTGATTTATTAATGACGGAATTGGCTTGAGCCCCCAGAACTCATAATCCTCAG 

ACTTATGCTTCCAGGGTATACAAATACTTTGAATATGTATCTTAATGTAATTAATCGTAC 

CAAATATATTATTATTACT 

Sequence 1118 

GCGTCCGTTGTCATCTATTTACTTTACATATGTCATAAACCTAACACTACATGGTCATTT 

TTGTTTAAACAGTCAATTACCTTTTAMGGGATTTGAATAATAAGTCAAAATCTAATACA 

TTMCTGTGTAGTTAGCATTTCTGGTGCTCTTCTTTCTTTTCTGTAGATCCATACTTCCA 

TCTGGCATTAT1TTCCTACTGCCAGAAGGACTTCCTTTAACATTTCTTGTAGTGTAGATC 

TGCTGGTGATGMTTCTTTCAGCTTTTGTMTTC^ 

GAGTATAGGTTMTAGCTTTTTCCTTTCAGTACTCTAAAGATGTTGCTCCAGGCCAGGCG 

CGGTGGCTCACTCCTGTAATCCCAGCACTTTGGGAGTCTTGAGGTGGGCAGAACACTTGA 

GGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCCGTGCTTCTAAACATAT 

TAAAGAAAAAAAAGA 

Sequence 1119 

NCGTGACATGCTGGCTGCTAGTNAGCTCCCCCATGATTGTCAGCTTCCGAGCCCTCACTA 
GAAGCAGATACCACCCCACCACCATGTTTCCTTTAAAGCCTGCAGAACTGNGAACCAATT 



# 




WO 01/070979 



PCT/US01/09126 



TABLE 1 
183/467 



MMCTCTTTTCTTTATAAATTATCCAGCCTCAAGTATTTATAGCAACACAATAATGGCC 

TAACACAACCTACMCTCTCTATATGTATTTGTGTGTATTTAAAACATGCAGGAAATAAC 

ACAGAATCCAAGGCACCCAAAACTCATTAAAAAATGGAATCAAGAATTCATATGCCATTA 

TGAMTTAGCCAGTCCTAAMTCTGACCTCTCTGCATTTTCACATTATTCTCCTCTCTCT 

ATCCCTGCCTTCCTCCCTCCCTTCCTCCAACTGTCAGAATTGTCCTGTAATCAAACATGT 

TCACATCACAGCTTn"CATTTTCTATTTCCAATCAATTGACCAGTCTAGCCAAGTAGCAT 

CCTGGATCCCGTATTACATATTCCTAGGACAGGAAGCCAGA7TTTT 

Sequence 1120 

AGCTCTTTAGCAGGAGACAATTCTTAAACTTAAATTAAACTGAAAAAGCCACAGAAAAAA 

GGGTTTGACACCCTAAAGCCAGTGTCCAAATGAACGCTACGGTTGNCCTCATAGGTGTGT 

TATGAATGTTTACCCTGACCTCCTAGAAGAAAAGGAAAAAGAAAGGAAAGAGAGAGGGAG 

AAGGTGAGGGAAGGGGGGAAGAAAAGAAAGGAGAGAAAAGAAGACAGAGAAAAAACAGGG 

AGGGAGGMGCTGGGGMGGAAAMGACCATTTGCTGACTCCG7TGTTTTATTTCCCAGA 

ATGATTCMTACCTCMGMGATTTCACTCCAGMGTGTACAGAGTTTTCCTCAACAACC 

TTTGCC CTCGAC CTGAAATTGATMCATCTTTTCAGAATTGTMGAGTACACATTTTAAG 

CCATATCTTTTTCAGCTTGCATTGATTCTCAGGTGGCTAGAGCAGGACTTGGAGTGGTAA 

TTGGAGATGGAAGAACATCATACACTGTGTCTAAA 

Sequence 1121 

CGGCCGAGGTACTATAATGGTCCCCATCTTMTTTGAAAGCGTTTGAGAATCTTTTAGGA 

CANGCACTGACGAAGGCACTNGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGT 

GGCTACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCAT 

AAGAGAGAAGATTCCTTTGAAAGCTTGGACTCTTTGGGCTCGAGGTCATTGACAAGCTGC 

TCCTCTGATATCACGTTGAGAGGGGGGCGTGAAGGTTTTGAAAGTGACACAGATTCGGAA 

TTTACATTCAAGATGCAGGATTATAATAAAGATGATATGTCCGTATCGAAGGATTTCGGC 

TGTTGAGCCAAAGACTGCGTTACCCTTCAATCGTTTTTTACCCAACAAAAGTAGACAGCC 

ATCCTATGTACCTGCCCG 

Sequence 1122 

CCCTTTCGAGCGGCCNTNCGGGCTTNTACGCGGGGGCAGCGGGAAGCTCGCAGCAGCTGG 

GGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGAT 

GAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCA 

AGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCA 

ACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATC 

ATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTACTA 

ACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTG 

GCTTGGGAATTTTTCTCAAAAATAT 

Sequence 1123 

CCCTTTCGAGCGGCCGTNCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

ACTGCAAAAMin"GCCMAATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGT7TGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1124 

CCCTTATTTTNGGCNTTNGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCA 

MMATTGCCAAAATGCNACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGG 

TTTGGAGTGTGGAAGCCTNATCCCTTCANCATCAAGCTGGAATGGGGAATGAAGAATGGA 
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NATGTGGTGCCCACTAGGCTACTGNTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTA 

TTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTA 

Sequence 1125 

CCCTTANCGTGNTCNCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TNTATAGAGGGTGGGAAAATAAACCANAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCGTCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATTGTTAACACCTGAATTG 

Sequence 1126 

CCCTTTCGAG CGGCC GCCCGGGCAGGTAC I I I I M I I I I I I N ) I I 1 1 I I NNGGACTTGCG 

ACAGTTCCACCTTTTACTCTCATTGGTAAMTCTCCTTTTMTTATTAAAAATATTGATA 

AATTTATTAAA TTAAGTC TTTNATTCTTTTGTAATCAGMGAGGACATTAATGTTGCGTG 

TCTTGACTGTCTTTTTTGCTTTGTAGATTTATTTTG 

TTTCTGNrTTGATTTTTCCAGMGCAGTTACTTTM 

TTrrGCT TTAG GCATTAGTTTCTTCTTCCTTTATAGNTTTTTCAAM 

TTATTTGTTTTGAAAGMGTAAATTTGGGGTAATTTITTC^ 

AGTTAAAAA ATT ACC AACTG ATTG CATTTCTTTACTTAATTTTG C AAATCG ATTTTACTT 

CATCAAAAAAAAGAATTTMNAATTMTTTACCTTGTTCAGGTCTTGGAATTTNCACGCC 

CTCTAAGACGAAGAGCCACTTTTACTCTGCGTATCTMTAAAMTTCTTTGGCTTTTTTT 
GCTT 

Sequence 1127 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGMGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1128 

CCCTT AGCGTGGT CGCGGCCGAGGTACATGCTANAAAACATTANCACAGATACGACAGAG 

TGTGGI Ulllll I AGAAATGGGTAATTTCTCTCTCCAGTATCCTTTCACTTGTATGAGA 

TATTTCTCCTCTCCTGTTTTCACAAACCAAGAAATCCCCAGGTAGGCCAATCCCAGAGGT 

GCCATTTAGCAGTATGCAGCAGCCCAGTTTCAGCATAACAAAACATGCCTTGGTAGTGGC 

TCTCTCATGCAMTAAMGMAGCTTAAGAAATTCTTGTTGTAGGTGGATTAGGCAAGGC 

TGCCATTCAGCTGGTATAAGCTAAAAGTAAAAAATCAAAACGCTCAAGAAAACGGACACA 

ATTTTGGMTGATTAAAGATGTCTTTATAMGTTTTTTTCMGACTTCATTCTA 

CAGAATAAMAATGGGTGTCAGCTCACTTGTAAGACACCMCCAGATTTTCCTTATACTG 

TCTCAAAATTTAAAGATCMTTTCCCCAGAAGGTGTNCAATGCATCATAAAATGGCCCTT 

TTTTGAGGATGGGAGAGGAAGGGTTGGGCAGGATGGAATATTAAATTGTACATGGATAAA 

CATGCCMGACTGTTATCCMTCTAGATMTTrATATACATTTTGATGACTTAAGGAAAA 

CAMGCMTCATTTGGTGACAGCCTAAAAAGCNTGACCNTATTTAACATACTTAGGAACT 

TTTTTNGG 

Sequence 1129 

CGTTCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAA 

AAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCA 

AMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTC 

AGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTG 

GAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCC 
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CACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATG 

TAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTA 

TGTTAACACCTGAATTGCTGGGTCTTGAAGAGAGCCCAAGGAGTTCTGGGAAGAGGGACC 

AGATTGGGGGGTAGGGTCACGGGCTTGGGTGATASAATTATTTCTCGAATGACTTTCTTG 

AGTGCCMTTTGMCTGTAACATTTGCTTANTCACCTTTAGNGGAGTAATCTCCTGGGCT 

TGGTTCTATATTTATATAAAAG 

Sequence 1130 

CGGGCAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTA 

CCAACTAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGMGCTCCTTTCTATT 

ACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGGTTCCAATAAGTTATCCC 

CAGTCTTAAANGTAGGTTAAGTACCTCNGGCCGGCCACCGGGNTGGAGCTCCAAATTNGC 

CCTATMGTGAGGTCGGATTTACGCCCCGCCTCACTTGGCCCGNNGTTTTACAACCGTCC 

GTNGA CTNGG GAAMCCCCTGGGC^TITACCCCMNCTTTMTCCGCCTTGGCAGCACAA 

TCCCCCTTTTCGNCCAGGTTGGCGTNAATAANCGAAAAAGGCCCGGAACCGAATCGGCCC 

NTTTCCNAACANGTTNGCGCCAGTNCTGGAATNGGCNAAATGGGGACCCCNCCCCTTGTT 

MCCGGGGNGCATTTAAACCNCCGGCCGGNTGNTGGTTGGNTTACCCCCCAANNGGTGAC 

CCGNNTTCMCTTTGGCAAGGGCCCCTMNGGCCGfST^^ 

Sequence 1131 

CGANGTACGCGGGGCTTTCCAAACAGCATAAGAAGTGTTTGAGCCNTNAGTATACTGAAG 

GAAGGGCTCCCTCGAGTTCTGGTGTGAAGAGATAAATCACCAGAAAAATGATGGCATCAA 

AACAAAAAAAGAACTCATTGTGAATAAGAAAAAACATCTAGGCCCAGTCGAAGAATATCA 

GCTGCTGCTTCAGGTGACCTATAGAGATTCCAAGGAGAAMGAGATTTGAGAAATTTTCT 

GAAGCTCTTGAAGCCTCCATTATTATGGTCACATGGGCTAATTAGAATTATCAAGAGCAA 

AGGCCACCACAGACTGCAACANGCCTGAATGGAGTCCTGCAGTGTACCTGCCCGGGCGGC 

CGCTCTTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAT 

ACCCGGCNGACCTNGAGGGGGGGGCCCCGGTACCCAACCTTTTGT 

Sequence 1 1 32 

AGCTCACTCAAAGGGCGGTAATACGGGTTATCCACAAGAATCAAGGGGATAACCGCAGGA 

AAGAAACATGTGAGCAAAAGGGCCAAGCAAAAGGGCCAGGAACCGTAAAAAGGGCCCGCG 

TTGCTGGCNGTTTTTCCATAGGGCTCCCGCCCCCCTGGACCGAAGCATCACAAAAAATCG 

ACGCTCAAAGTTCAGAGGTGGGCGAAAACCCCGGACAAGG 

Sequence 1133 

CCCGGGA ACAAAA GCNGCAACCGNGCCCCCCCTCCAGGTCNACGGNNTCGANAAGCNCGA 

AAACCGMTTTTTGNAGNTTTNGGGGACCCACTANTTNGNGAGCGGGGGCCGANNNAGNG 

CGGC 

Sequence 1134 

CCCCCCCNCGAGGNCGACGGNANCGANAAGCAGATNNTNAATTTTTGCAGCCCGGGGGAT 

CCNGGAGNGNGAGAGCCCACCGNGGGGGNGCGCCAACACCCCCGANAGAGAGNCGAANNA 

CGCGCGCNCACNGGCCGNCGANAAACAACGGCGAGACNGNGAAAACCCGGGCGANACCCA 

ACANAAGCGCCAGGCAGCACANACCCCAANCGCCAGCNGGCGAAACANNnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGANACCGCGCAGCGNGACCGNAACACNAGC 

CAGCCGCCCNAGCGCCCGCNCCAAGCGCNNANCNNCCCANACCAACANCGCCACGNACGG 

CCGGGCAANCCCCGGCAAGCACAAAAAGCGGGGGGGCNCCCTGGAAGGGGNNCNCGAANA 

AAAGGCAAGACGGGNACCNCNGAANCCCCAAAAAACC 

Sequence 1135 

GGGAGTNNGGGAGCGGCCGAGGGCCAAANGANCCGCGAAGACAAGGCCATCCACCACTNN 

ANAGAGGGGGNAAAAACAAACCAGAAACCAAGGGAGAAAGAAAAGANGAAAGACAAACNG 

CAAAAAANNGCCAAAACGCGACNANCAAAAAANGGAGCAGANGCNGAGGCANNGCAAGNC 

ACGGCANACCGNCAGGAGCCGAANGAAAAAAAGCAACAAGCAAGANGAAGACNCTGAGAG 

GGGCNCGGAGGCCNGGAAACCNCATCCCTNCAACAACAAAGCNGGAANGGGGGAAANGAA 
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GAANCAGAAGAAGGGGGCGCCCACNAGGCTACNGCNGAAAGGGAACCGAAAAACCCCNCA 

CCAANNAGGNATNCAAAAAANNGGAAACGGACCGGGNCANGGCAAAAAAAANGGAACCAA 

GACACCGGGCCCANACCACNGGACCANGGGGNAANGGAAACACCCCGGAAATGGCAGGGN 

CCCCGAANAGAACCCCAAGGGAGCCCAGGGAAGAGGGACCCAAGATGGGGGGGGAAAGGC 

CCACCGGGCCNGGGGNGAAANAACAAAGCCACCGAGGGCCAACCGNAAGNGGCCAANNGA 

ACCCGGAACNANAAGGCNNAANCCACCCTGAAGNGGGAGGAAAACCCCCAGGGCNGGGGG 

CCAAANANNAAAAAAAAGCNGCCCAAAACCCCCAA 

Sequence 1136 

hflTTMTTTTTTGCAGCCCGGGGG 

CAANCGCCCNACA.GCGAGNCGNAANACGCGCNGCCCACNGGCCCGCCGNANAAACAACGN 
CGAGACGGGGAAAACCCCGGCGNCACCCAACNGAAACNGCCANGCAGCACAANCCNCAAN 

CGCCAGCGGGCGGAANAGCGAAGAGGCCNCGCACCGAACGCCCANCCCAACAGNGGCGCA 

ACAGAAAGGGCGAAA 

Sequence 1137 

TTTGCANTTTNGGGGACCCACNAANTGGNGAGCGGGCGANGAACCACACNNACGGCCCGA 

GNCCNNGNCNACNGGNCNAGAGNGANAANCACAGGGACACCGANCANCCCTNNCGGCANG 

GACCCACAGCAGCAGGNGGANNGCCATACCGAGANGAAGGAAGACNCAGACANCGCCAAC 

CATNCCCGAGNGGACCNGCCCGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACGNNAAACAACCGGCGAGACNGNGAAAA 

CCCNGGCGNNACCCAACNNAANCGCCGNGCAGCACAACCCCCNACGCCAGCNGGCGNAAA 

AAGCGAAGAGGCCCCGCACCGAGCGCCCNTCCCAACAGGGGCGCAGCCCGAANGGCGAAA 

NGGGGACGCGCCCTGGAAGCGGCGCAANAAAGCGCCGGCGGGAGGGGGGGGANACNCCCA 

NNCGGGACCGCNACAACNANGCCAAGCGCCCNAANCGCCCGGCACCCANACGCTANCNAC 

CCNNCCNANCCACGCCCACGGGANCGCCGGCNANCCCCGGCAAGCCNNAAAAAACGGGGG 

GCNCCCCNNAAGGGGNCCCGACANANGAGGCCNNACCGGCACCCCGACCCCCANAAAACC 

AGGAAAAAGGGNGGAAGGGNCNACGGAAGGGGGCCC 

Sequence 1138 

CACGCGTCCGGAAAGGAGGGGAGGGGTGGGGCGGGCCGGATCTGCTCGCAGGCCGCACCC 

GCCTCCGGCTGGATGCTCAGAGTCTCGCCCTGATGGCCAGACTGGAGTGCAGNGGCGTGA 

TCTCGGCTCACTGCATCCTCCACCTCCTAGGTTCAAGTGATTCTTCCACCTCAGCCTCCC 

CTCCCAAGGAGCTGGCATCACAGGACAGGCACGGAGACTCACGCCTGAAATCCTAATACT 

TTGGGAGGCCGCGGCAGGAGGATCACCTGAACCCAGGAGTTTGAGACCAGCCTGCCAACA 

TGATGTTGTTATTCATGAGGACCAATGGGTTGGCGAGACAGTACTACAATCAACATTTAG 

CAGTCAGTTATTAAATCTTGGGAGTTATTCATCTATTCAGCCTGAAGAATATTCCAGTGT 

AGTTAGTGATGTTGTACTTCAAGACTTACTGGCATATGTGTNCTCAAAACATTCC 

Sequence 1139 

TTACCCCGCGTCCGGGTAAACAAAACAAAGATCGTTTGTTCTGGAAACAGGTAAAATGGT 

MTCAMTAGATTGTGTTCCAGGAGTGCAAAGGTGGCTTAATATTCACAAATCAGTTGCT 

ATTGTACACCACCTGTAGAAAAGTAATCTGGCATGCAGAACATTCTTATGGTAAAGTTAA 

TGTTCATTTATGATCTTAGCAMTGATGGATTGAAAGGGACTTCCTTAATTGCATAAACA 

GACTTCAACAAACAGTATGATGAAATAGTGAACATTTCTCCTAAGATTATAAAAATAAGA 

CMGGATATCTGCTGTCAATGATTTTATTCAGCATTGTTCAGAAGGACCTAACCAGAAAA 

CTAATGCAAGAMCAGAAACAAAAGGCATAMGATTANMAAGMGTAAMTTTTAAAAA 

AGAAAMGAATATAAATCTCTATTTGCATATGCCATGAGTAMTTTGGTMGTTCCCTGC 

ATAAMGTTATGCAAAMGCATTTTTTTGATATACCAGCCAAAAATCAAGGGAAATGGAA 

AA 

Sequence 1140 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 
AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 
AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 
CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 
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AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTC 

TTGAGAGAGCCCAAGGAGTTCTGGGGAGAGGGACCAGATTGGGGG 

Sequence 1141 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1142 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAG 

Sequence 1143 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGTGGATA 

GAGAAACTGAAGCTCTCTAAAGACCCTGCAGCTGGGAGGTGGCAGAGTCAATGGCAGCCC 

TCAGCCCTATCTGCCCTGACATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAA 

TAACMTGTGTGGGGTCCTTGGCTGAGAGACTTNCCTTTTGGGAATCAATCTGAATGTAT 

GATGACAAAGAMACAACTT1TGCTTTATACAACCTTCTGGTTAGATTCAGGCACCAAGC 

AGGACACTTCTTTGTGGCGCTCCAAGAATCTTTCAAATTCTTCATCACCAATAACAAATC 

TTTCTGCTTCTCTTAGAGCATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGA 

ACACTTTCCAGCGGACAGGGTTTAGTGCTTTGATCTGTTCCGTCATGTCCTCTTCCACGT 

TG AAACGATTAATG ACAGAA I I I I I I I I GGAGGCGACTCTATTAATCCCTACACCACCTN 

CTCAGCTTTTG AAGG GTTTN CACATG GGTTCTTTT 

Sequence 1144 

GNAGCTCCCCGCGGTGGCGGCCGAGGTACGCCACCATGCCTGGCTMTTTTTGTATTTTT 

AGTGGAGACGGGGTTTCACCATGTTGGCCAGGCTGTTCTCGAACTCCTGACCTCGTGATC 

CACCCACATTGTCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCATGAGGAT 

TAGTAMGTGCACTCATGGTMGTAAAAAMTTTGTTTTATGTTNATGCTGATTATATGA 

AG GTCAT CATAGCTTAGACACMTCAAAACCCATGGGGMCATCTTTAGMTTCATTTTT 

CTCTTTTCTTACAAAAAAAGTAAATAGGTAAAATGGAAAATAGAAGACAACCTATCCTAT 

CCTGGAT GAGACACACACATATTTAMTTGAATTATAGACTTAMATTTAAGTAGGGANT 

I I I 1 1 I I l NTGNGNMCAMAGTTTNCAAAAAACCCAAAACTTTTNAGAATCACCCAGTT 

NTTGGAAAATATGATTATGAMGCAGACTTTTTGGATGGGNGCTTMTGACATTTAGGCG 

A CATT TAAAATGCCCCTAGGNGNGGGAACACTTGAAATTGCCANCTAAATTAAAATGACC 

CTTTTMTTTGCCTGGACMCAMAAANTTTCCATGATT^ 

GANNAAGAAA AI H I I I I 1 1 I AG 

Sequence 1145 

CNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCANGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 
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ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 
Sequence 1146 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCACGGTACTGTGTCAACAT 

GCAAGGCCAGGTGGCCGTGAGGAATGTCCCAATGACAGGCTCTATCAGTCATGTCCTGGT 

GCAGCTCCC ACATGT CTTCCAGAGGATGGAAGCTGAAAACCTAGCTTCAGTGATTGATGC 

CAGGTTTMCTTTTTTGTGMCAAGATTTTGCCACAGTATCGTGATGCAGTCATGTCTCA 

CACGCTCATCTATATCCCCTCCTACTTTGACTTCGTGCGTCTTCGAAATTACTTCAAGAA 

GGAGGAATTGAATTTTACCCACATCTGCGAGTACCT 

Sequence 1147 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTNTTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGGGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGTAGGTCC 

Sequence 1148 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTTCT 

GCGCAGCTTCCCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGA 

AAGATGAGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGAACGCATTT 

ACTGTCACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAA 

TGCAAATTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATG 

GAGGATMGAACATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGT 

AGCTACAGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTT 

CAGATCACAAGATGTGAAATTGCAGGATGCAGGGGGTGTACCTTGGCCCGCTCTAGAACT 

AGTG 

Sequence 1149 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACGAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGGGGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGG 

Sequence 1150 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 
TGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 
NAAAGAAMGATGMNGACAAACTGCAAAAAATTGCCAAAATGCNACTTTCTAAAAATGG 
AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATTGAAAAAA 
Sequence 1151 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 

TGAGGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 

GAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCTAAAATGCGACTTTCTAAAAATGG 

AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCA 

ACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATC 

AAGCTGGAATGGGGAACGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 
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Sequence 1152 

CCGCGGTGGCGGCCGCAGTTACCTGACGTATGACAACCCAGATATCTTGAAGAGGGTGTC 

AGGATCAAGATCMATGTGTTCATGAGGTTTAAMCAACTGCCMGGATGGCCTTTTGCT 

GTGGAGGGGAGACAGCCCCATGAGACCCAACAGCGACTTCATTTCCTTGGGCCTTCGGGA 

TGGAGCCCTCGTGTTCAGCTATAACCTGGGCAGTGGTGTGGCATCCATCATGGTGAATGG 

CTCCTTCAACGATGGTCGGTGGCACCGAGTTAAGGCCGTTAGGGATGGCCAGTCAGGAAA 

GATAACCGTGGATGACTACGGAGCCAGAACAGGCAAATCCCCAGGCATGATGCGGCAGCT 

TAACATCAATGGAGCTCTGTATGTGGGTGGAAT 

Sequence 1153 

GCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGAACTAMTTTTTTAMCTTTA 

TTTGCTGTTAMTTCTGTGMGTTTCAGTTATCTAAAATAAATATACACAAATATGAAAT 

ATMTGTTTCAGATTGCAAGGTAATATGTAATAGTAGTGTTTGTAAGATACTCTTGTCTA 

ATATTMCTAGTAGTATTTTGATTTGTACAATGTCACCCTCCCAGCAACAAGAAGAACAA 

GCTACTGMTCAGTGTCCCTTTATTACTATGGCATCAAAGATTTGGCTACTGTTTTCTTC 

TACATGCTAGTGGCGATAATTATTCATGCCGTAATTCAAGAGTATATGTTGGATAAAATT 

MCAGGCGAATGCACTTCTCCAAAACAMACACAGCAAGTTTAATGAATCTGGTCAGCTT 

AGTGCGTTCTACCTTTTTGCCTGTGTTTGGGGCACA 

Sequence 1154 

GAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGT 

Sequence 1155 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAATGTGTGAAGACAAGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAMCTGCMAAAATTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTT 

GCATGTCTTGGCATTTCCTTCAGGAGCTGAATGAAAAAATGCANCAAGCANATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCAC 

CAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGG 

CCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGG 

Sequence 1156 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTC 

ACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGA 

GGCT GGCTCAGGTTTGACTCTATCTTGGAAAAAAATAGGAAAACTTCATTTATGGAATAG 

TTTTGMTAACCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACG 

GGTAGAATGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTC 

TGG AATTG GTCTGACCCMTTAACACACGGCCTCTGATGGGAATAGATGTATTTTGGGGA 

CACATTTTAATCTGATAGCTGTMCCCCTTTTGAGTTGGCTTTTGTTCACTGGAATCCCT 

TTCCAGTCAATGAATTTCCGAGAAAAATTCAGAGGAAGAGCTGTCGGAGGCACCAGAGTG 

CTGATGTTTTCT 

Sequence 1157 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 
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TCATTTCGTGAAGGAGCAAGGCTGMGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGMGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCMTACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATG 

TTTMCACCATCTTACTAACCATTATTGCTATGGGTGGGTATACACTGCCTATGTCTTTA 

TTCCMTCCACATTCGCCTGGGCTTGGGMTTTTTCTTCAAAAATATGTGGATATCACAG 

GNCCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCTTGCAGGGNAT 

Sequence 1158 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGA 

AGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATA 

CACCCAACTGGGAGATGAGGAAACAGCAACX)CAGAGAGGAGAACTAACCCACACAGGATC 

ATTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTG 

CAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTA 

CTATCMTACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTATGT 

TTAACACCATCTTACTAACCATTATTGCT 

Sequence 1 1 59 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGAA 

GCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATAC 

ACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCA 

TTTCGTGMGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGC 

AGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTAC 

TATCMTACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTT 

MCACCATCTTACTAACCATTATTGCTATGGGTGGTATACACTGCCTATGTCTTTATTCC 

AATCCACATTCGCCTGGCTTGGGAATTTTTCTCA 

Sequence 1160 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCACAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAMATGGAGCANATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTNAGCATCAAGCTGGAATGGGGAATGAAGAATAGAN 

ATGTGGTGCCCA 

Sequence 1161 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGAAGGTGGGTGACGTGCGGA 

TCTTCTTCTTTTTGTGGCTGTGGACACCTTTCMCACTGCCTTCTTGGCCTTTAAAGCCT 

TCGCTTTGGCTTCAGCTTTAGGAGGGGCAGGAGCCCATCGCAAAACCACGCTGCGGAGAG 

AGGGGCGGGTAATGTAGCCCGGTTGAACATGAACCAGAAGGAAAATGGTTAAAGCTGAGG 

GCACTAATTCCTTACAGGCCCGGGGACATGGAGCTCCAACCAGTGGATGCATGTAGCTTC 

CCAGAACCGAATGTCTGCCCCGCGTACCT 

Sequence 1 1 62 

CCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGNGGATAGAGAAACTGAAGCTCTCTA 

AAGACCCTGCANCTGGGAGGTGGCAGAGTCAATGGCAGCCCTCAGCCCTATCTGCCCTGA 

CATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAATAACAATGTGTGGGGTCCT 

TGGCTGAGAGACTTCCCTTTTGGGAATCAATCTGAATGTATGATGACAAAGAAAACAACT 

TTTGCTTTATACMCCTTNTGGTTAGATTCAGGCACCMGCAGGACACTTCTTTGTGGCG 

CTCCAAGAATCTTTCMATTCTTCATCACCAATAACAAATCTTTCTGCTTCTCTTAGAGC 

ATCTTCTCCACMTTCTCACCCTCAATTAAGAGGCACTGGAACACTTTCCAGCGGACAGG 

GTTTAGT 

Sequence 1163 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 
CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 
AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 
TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 
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AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGAC 

TAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCT 

Sequence 1164 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNGGCACGGTNCCAAATGAAC 

GTGTGAAGACAAGGCCATNCACCACTTTATAGAGGGTGTAAAAATAAACCAGGAAATCAA 

GGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAA 

ATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAA 

TGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAG 

CATCAANCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCTGAAATTCCTCCACCAAGTT 

Sequence 1165 

TTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTTTCTTTNTCCTGT 

CCATAAATCTTCTTCCACCACGTGGCTGTGTNCAAGACTCTCTGAACCTNCTCTGGCTCA 

GGAGGCTTNTAGATNTGTGAATTGTCTGCTCAGTNNAACTCCATTAAATTNAATNTGGCC 

AAGMNTTTCTTCTTMCAGNGGTATTGATGACCATTAATCCTTCAACCTAAACCTGCTC 

ATTAA 

Sequence 1166 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTT 

Sequence 1167 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAGCGTTTGT 

MTGTTTCCCAAATATTAGCTTTGAAATCCAAATGTCAAGCAGGTTAAAGTTCANAAAAC 

TATAMTGTMTTGTTAATAAMCMTGGATGAAAAAAGTCATTGAMTTTTTTTCTACT 

TGGATTAAGMCATAAATTAAAAGTGCAACTGCAAAAATAATATTAGTTTGATAAGTAAA 

ATAMACAMCTAGATATATT1TGAAAATAAAAMCAAATGMACAMATAAMTTTAGG 

TAAAGAAAATTCAACGTMTTTGTTGTAGCTATATTTTTTGTMTMTTACAAAAGTAGA 

MT MTA GCTCATAMGCAAAMCMAATTTATTCTATGTTCTTTTTTTCAGTCMTTCA 

GAATTTT AGCTT CATATTTGAAGCATTTTTTTCTAATTTTGT^ 

TTGCGGATTTTGA7TTGCCATAAAMTATTATATTNTATTTMTTATATTAATCTTCGTC 

AGCTTTAATTGCTCTTCTTTAAAATTTGATCTGAAAT 

Sequence 1168 

CCGCGGTGGCGGCCGAGGTACCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTCCAGGA 
TCAGAAGAGAAGCCAACGTGACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCT 
GTTCACGMGTGGTTGMAMCMTTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCAT 
CCAGGTGTTCACAAAAAATCAGAGAATCTTTTTNAGGGGGTGGNCGCNTTMCCCTTTTN 
NGGTTNANTGAAAATCCCCCCCCTGN I HUM I GAGAGGTTNAAAAA I i 1 1 H II IN AA 
AAAAAAA ACCCCCCCCCCNGGGNATTNTNAN I I I I I I I I I I NNNNNCCCCNNCAAAAATT 
I I I I I I I AACCCCCCCCCNNAAAAAAAAAAGGG I I ) I I I I I I I I I \ AAAAAACCCNCNGN 
TCNCCCNNNC CANAAAAAAAAAAANGGGGGTCCCCN I I I I I I I I I I NTTNNCCCCNGGGN 
I I I I I I II I I I I I I I NTTCTNNNNNNGGGGGGGGGGAAAAAAAAANAA Al i I I I J I I I I J 
TGGNGGGGGGGGNNGCNCCCNCCCCCCNCGGGGGGGGNGGNTTAAAAAAAAANGNCCCCC 
CCNGGGGGGGAAANNANTTTTANTNNNNTCCCCCCCCCCCNGGGGGGGGGG 
Sequence 1169 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 
CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 
ACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 
TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCMCAAGCAGATGAAGACTCTGA 
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GAGGGGTTTGGAGTCTGGAAGCCTCATCNCTTCAGCATCAACTGGAATGG 
Sequence 1170 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGA 

GAGGGGTTTG 

Sequence 1171 

GNGGCGGCCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGG 

GTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAA 

TTGCCAAAATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCAT 

TCCTTCAGGAGCTGAATGAAAAAATGCAACA 

Sequence 1172 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTAGGTCAG 

TTTAAGAAGAGTCAGCTCAGAGAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATC 

AGGGAACAAAATCCTCTCCTTGTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTA 

TCTTATTGCCTAAAAAAMTTTCTTATCATTGTTTCAAAAAAGCAAAATCATGGAAAATT 

TTTGTTGTCCAGGCAAATAAAAGGTCATTTTM7TTAGCTGCMTTTCAGTGTTCCTCAC 

TAGG TGGC ATTTAAATGTCGCCTGATGTCATTAAGCACCATCCAAAAAGTCTGCTTCATA 

ATCTATTTTCAAGACTTGGTGATTCTGAAAGTTTTGG I J I I I GTGACTTTGTTTCTCAGG 

AAAAAAAATATTCCTACTTAAATTTTMGTCTATAATTCAATTTAAATATGTGTGTGTCT 

CATCCAGGATAGGATAGGTTGTCTTCTATTTTCCATTTTACCTATTTAC 1 1 1 I I I I GTAA 

GAAAAGGAGAAAAATGAATTTCTAAAGATGGTCCCCATG 

Sequence 1173 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCTGCACACTGTGTTTATGACTTGTAC 

CTGCCCG 

Sequence 1174 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCNTGCACACTGTGTTTATGACTTGTA 

CCTGCCCG 

Sequence 1175 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTNNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 
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GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGNTGCACACTGTGTTTATGACTTGTAC 

CTGCCCG 

Sequence 11 76 

CCGGGCAGGTACAACAAGCGTTTGTMTGTTTCCCAAANATTAGCTTTGAAATCCAAATG 

TCAAGCAATTAAAGTTCAAAAAACTATAAATGTAATTGTTAATAAAACAATGGATGAAAA 

AAGTCATTGAAATTTTTTTCTACTTGGATTMGMCATAAATTAAAAGTGCAACTGCAAA 

MTMTATTAGTTTGATAAGTAAMTAAAACAAACTAGATATATTTTGAAAATAAAAAAC 

MATGAAACAAAATAAAATTTAGGTAAAGAMATTCMCGTMTTTGTTGTAGCTATATT 

TTTTGTAATAATTACAAAAGTAGAMTAATAGCTCATAAAGCAAAMCAAMTTTATTCT 

ATGT TCTTTTTTTCAGTCAATTCAGAATTTTAGCTTCATATTTGAAGCAI I I 1 1 I ICTAA 

TTTTGTTTGTGAATTTGAATTTGTTTGCGGATTTNGATTTGCCATA 

TATTTAATTATATTAATCTTCGTCAGCTTTMTTGCTCTTTCTTTAACAATTTGATCTG^ 

AATTTGTTTTGGTGTTATTTCATAGTGATCAMTTGCATTTGATAAGTTCCACGACCT 

TGTCATAGACCTTAATTGTGTTGAGTATCCAAACATTTC 

Sequence 1177 

TAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTACGGAAATCCT 

AACTACCACTGGCAGGAAACTGCATATNTTCTGGTTTACATGAAGANGGAGGGCTAANGG 

AAATGCCCAAAACCTTCAGAGATTGACACCGCTGTCATTNTCCATNTCNGTTCCTGGAAT 

CTACCGGGAGTNTTNATMGMGANTTTTGCAAATNGAGGGMGMGCMTTGTTTTCAA 

ACTATATAACTGGAGNCCTTMTTTATAATTAGGGGATATTTAATCAAAAATATNGTAAA 

CCATGGAGGGCCCCCTCAGNGTNCTGGATCAGGTTCMGAAATNGAAATGNTTTTTCACCC 

AAGNCANGACCCCGGCCATGTGGGCATGNTCCGGGTNCCTGGGGGTGGCNTCGNCTGGCT 

TGTGGCGMNGAACAATTMGCCCCTTTTMGTTTATTGMGCCCTGNGGGGAMCTTTA 

AGGGGGTTTCCANAGTTGGGGGANGAAGCANTNGGNNAGTTGGNGAAGGGCATTTTGGGG 

GGG 

Sequence 1178 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GMAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGAC 

ACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

Sequence 1179 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAMTGCGACTTTfsrTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGG 

Sequence 1180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATTGCCAGGAAGATAGGCAGCTCATCTGTG 

TCCTGTGTCCAGTCATTGGGGCTCACCAGGGCNNCCAACTCTCCACCCTAGACGAAGCCT 

TTGAAGAATTAAGAAGCAAAGACTCAGGTGGACTGAAGGCCGCTATGATCGAATTGGTGG 

AAAGGTTGAAGTTCAAGAGCTCAGACCCTAAAGTAACTCGGGACCAAATGAAGATGTTTA 

TACAGCAGGAATTTAAGAAAGTTCAGAAAGTGATTGCTGATGAGGAGCAGAAGGCCCTTC 

ATCTAGTGGACATCCAAGAGGCAATGACCACAGCTCATGTGACTGAGATACTGGCAGACA 

TCCAATCCCACATGGATAGGTTGATGACTCAAATGGCCCAAGCCAAG 
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Sequence 1181 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 

CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAMGACAAACTGCAAAAAATTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGA 

GCCCAAGGAGTTCTGGGAGAGGGACCCAGATTGGGGGGGTA 

Sequence 1182 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAA 

GACMGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAAiCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCT7TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 

Sequence 1183 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

GAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

ACTAAGACACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTAGGTCCCGGGCTTGG 

TGATAGAMTATTTCTCGATGACTTTCTTGAGTGCAATTTGNACTGTAACATTTGCTTAA 

TCACCTT 

Sequence 1184 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTAT 

TATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACNGCCATAATGCC 

GTGCACAGNTTGAAAAGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTTN 

CTCAAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAA 

TAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCT 

NTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTNTACTTT 

GCAGAACCTCACCMC7TGCCCAGGTNTTCTCCCCGGTCTTGAAGAAATGGCTCTCCACC 

TGAAMGTNNGATCTTCTCCATACCAGCTTCTTMGCAAAAGCAATCCTCTCTTTGCTTC 

CTCAAGGGGCA 

Sequence 1185 

TAGGGCGANTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAA 

GGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAA 

GATGMAGACAMCTGCAMAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTC 

TGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGA 

TGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAA 

TGGGGAATGAANAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

TCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATG 

Sequence 1186 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
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CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAMGACAAACTGCAMAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAG 

CCCAAGGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTA 

Sequence 1187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAMATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1188 

CCGCGGTGGCGGCCGAGGTACAAGATANTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTT 

GAAMGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTTCNAAGTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGNCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGT 

Sequence 1189 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGGAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAA 

AGAATTGGACTAAAACACTGGCCATACCCACTGGACAG • 

Sequence 1 1 90 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGCGAAGCTAGA 

GACTGTMCCTGAAGATTGGGACMATTAAAGAAAAAAATGTGATTTAACACAATTACAA 

AAACTGTTACGTTAGGGTCAMCMGMCCATTTTATGAAACTGAATTACAACAAATGAC 

ATTATATCTAACTCTTCCGGGTCTCCACAACACTTATACTTACTTAAGCAGCTTAAACAC 

TTCCGAGTCTCCACAGCACTCTGATACTTACTTAAACAGCTCTTTTAACCTGCCCTAGTA 

TTCTTAAGTGCAGCATATCTAA I I I I I I I I 1 1 CCTCAAGTAGTTTGAA 

Sequence 1191 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAATGAACGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTMTGACTGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 
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Sequence 1192 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCNTCCACCACTTTATAGAGGGTGNAAAAATAAACCAGAAATNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCC 

GATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGA 

Sequence 1193 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAMGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCT7TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGGAAAAATGCMCAAGCAGAT 

GMGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTGATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAG 

ACACTGGCCATACCACTGGACAG 

Sequence 1194 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACATAAATCACCTGGAACCTTG 

TTAAAATGCAGATCCTGACTCAGGAGGTCTGAGTTAGAGCCCAGGATTNCATATTTCTAG 

CCAGCTCCATGATGAGCTGCTGGTCCGCAGATCATGCTTGCNGGTTTTGACCAGAGTCAG 

TGTTGGTTANAGTAAGAGGATGAGGCANACATNTGGGAAAAGTCCAGCTGGGGCAAGCAT 

TTGAAGTCTGCCTTCCTACCANGTCAAAATCAAGGCAACGACCTTCCATAGATAACTATC 

AAAGCTTGAGGGGGNGCCTTGAACCCAACTCCTAAATCCCTAAGACCTGCCCACCTCTTG 

TGTCTCCTGTNTNAGCAAACATTCCCACACTCTTGCATATTGTTAAAAGTAACCTCTGCT 

TACCAGGCTTTTG 

Sequence 1195 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGGGGTGTCCGAACAAGGCAGGTTG 

GTGGGTTAAGGTCTTAATCTTGACTCGAGATCTCTCTCCGGAGTTCACAGNGTNGGCGAC 

GAAGCCGAAGCAGCTGGAGCGCGACCCGGAGGAGTCTGACTTCTCGTTGTCTTCATAATT 

TTCATTCGTTGCTTTCTTCGTGGACTTGCGGCTGGGGGAGGATCCCCGCTGGTCGCCGAG 

CAGGCGGGCGGGTAAAGGTAGGCCGCCGAGAGCGAGGTTAGGAGAGGAGAGGAGGCCGCA 

GTACCT 

Sequence 1196 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGAGGCGGAAGCGC 

AGCGGGGGCGGGAAGGTTGTAGTGCCGCGAGTTGAGCTCCTCTTGCCTAAGTGGTCGCGC 

CCCCTTTAAGAGCAGCGATTGTAAGGAGAGGCGGTCCCGGTGTCCTCGGGTCCCAGGTGA 

TTGTGAAGTGCTGACCAATTGCCACTGGACATACTTGAAACAAAATAGGAAAATGGCAGC 

AAACCCTGTCTCTAAATCAATCAATCAAGCGAGCCAGAATGCAGTAGTGGCCTGAGAGAG 

GCATCCTGGMCGCAGTGCGGTCTGGCTAGGCTTAGMGTATTCATGTGATTTTTACCTG 

ACAAGGG 

Sequence 1197 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAMAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGG 

Sequence 1198 

ATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCG 

AGCTTCGGAGAAGCAGTGGTGGGTTCCATGTGGTGGTGGAGTAGGAGGCAGGTCTCCGCG 

GTGGTTTCCACAAGAAAAATGGCACAATGTTTCTCAGAAGACAATTACATAAGAATCAGC 
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ATACTTTAAATTCACAGCAAATAATCAGACAATTGATGAAAATACTTACCCAAACACTAA 

TTGTAGACTGTGCCTTCTGMTATGTTTTGTCATAAACTTGGAGTAAGGAATCCTCACAG 

GCACTGGACMTTCAAAAMCGTAMGTTTGTTTGTTAGMTACCTGGGTGCTTTTGGAT 

AGAMCCCTCATCCATATCCTGGTMGGCTTGMGTTGCACAGGAGTTTTCATTTGTCAA 

AACCCAGAAAACCATAAGCTTTAGATTTGGG 

Sequence 1199 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAMATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTTTGA 

GAGGGGTTTGGAGTCTGGMCCNTNATCCTTTCACNTJCAACTTGAAATGGGGAATGAA 

Sequence 1200 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAANTGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAATCAAGGGAGAANGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCNGACTTTCTAAAAATGGAGCAGANTC 

TGAGGCTTTGCATGTCTTGGCATTCTTCAAGAGCTGAATGAAAAAATGCAACANGCAGAT 

GMGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATATAGATGTGGTGCCCACTAGGNTNCTGCTGAAAGGGAGNTGAAATT 

CCTCCACCAAGTNGGTATTCAAAATA 

Sequence 1201 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACGCGGGGGTAAC 

TGAAAATCCACAAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGC 

AGAACGGCTGCCTAATTTACAGCAACCATGAGGCCACTTAAGGATGCAGCAAGAAGGAGC 

CATCTGCAATCCAGGAAGAMTTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTA 

GGACTTCTAGCCTCGAGAACTTACAAATGGTGATGATCAT 

Sequence 1202 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAANNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

NATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAA 

ATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAAG 

Sequence 1203 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATAAAATAATA 

CATATG AAGCC A CAATA TTCAATTACAGTTCAAACTAGAAATTACTAACTACATGTAGCT 

GCTTCA \ i I l ICGTT TTGCA TTTTGGCCCCTAATTCATATTTACAACTGATACCTGCTGA 

GAMAAGATCCAA ACTTT TAMCTTTGTATGTTTTGTGGAGGGTGCACAATTTCTTCTAA 

TATATCTTCAGGTGTTTTAA MTTT MTTTGTTTTTMTCATAAGATATCATCATGGCCA 

AGAGACTGGGAAMTMCMTTTTATTCTTTCTCCTMGATTGNGATTTTATTATTCCM 

GATCTTATGCTTGAATTACTTAGCAAGAAGGCATGATTATGCANAAGACAGGGAAATGAA 

GAGAAAAGAGCGGGAATATACGAAAATGAAGCTTCCTTAACAGAGTTCATGGTGGAGATG 

GTAGACACTGGTGGAG I I 1 1 1 I I CCAGACTTAA 

Sequence 1204 

TCGAGCGGCCGCCCGGGCAGGTACACTCTAAAGAAAGCCATGAGGATGATAATCCACTTT 

GATAC TTCCAATCT GCTGGTCTTGCTGAACTCTTTGGATCATGGATATCATAAGTTCGAC 

AAAATAT I fTTTl IGTAGAAGCACAAATGTGAAGNGTCACTCGTTCTGAGACTTCCTCCT 

CTGTGAAA TTCCA CAATCTCTTTCTATTTATAGACTTTTCCACAGCAAACATTAGTCTAC 

GCAGAGCATTTTGAAAATCATTTGCCAGTTCTAMGTAGTAATAATAAATACTCCAAGAA 

CTAAAAGTCCCCCTGGTAGCATTCTGGATACCTGGCAGGCATGTTCTGTGGCCCATTC 

Sequence 1205 
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' NNGNCCTTTCGAGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCC 
ACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAG 
ACAAACTGTAAAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 
TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 
CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 
GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 
Sequence 1206 

AGCGTGGTCGCGGCCGAGGTACATAAAACATTATTCCTTCCTTGGCCTAAAAACTCATCG 

CCACCTACATTAMGCTMTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGG 

CAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGNTAGCTACATAAAAATGCACCCT 

AGACCCGAMCTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTCCACGCAGTCCCTG 

GTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAAC 

TTACACI I I I I I I I MTCTTTTCCTACAACTTC ATATTTTATAAATA 

Sequence 1207 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATCTAGGTCAGTTTAAGAAGAGTCAGCTCAGA 

GAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATCAGGGAACAAAATCCTCTCCTT 

GTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTATCTTATTGCCTAAAAAAAAAA 

TTTCTTATCATTGTTTCAAAAMGCAAMTCATGGAAAATTTTTGTTGTCCAGGCAAATA 

AAAGGTCATTTTAATTTAGCTGCMTTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTC 

GCCTGATGTCA TTAAG C ACCATC CAAAAAGTCTGCTTCATAATCTATTTTCAAGACTTGG 

T GATTC TGAMGTTTTGGTTTTTGTGACTTTGTTTCTCAGGAAAAAAATATTCCTAC 

AATTTTAAGTCTATAATTCAATTTAAATATGTGTGTGTCTCATCCAGGATAGGATAGGGT 

TGTCTTCTATTTTCCATTTTACCTAT 

Sequence 1208 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCAC 

CACTTTATAGAGGGTGTAAAAATAAACCAGANATCAAGGGAGAAAGAAAAGATGAAAGAC 

AAACTGCMAAAATTGCCAMATGCGACTTTCTAAAAATGGAGCANATTCTGAGGCTTTG 

CATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCT 

GAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGA 

AGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCA 

AGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

Sequence 1209 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGNCAAAATG 

CGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTNTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTNAGCATCAAGCTGGAATGGGGAANGAAGAATAGAGATGTGGTGCCCACTA 

GGCTACTGCTGAAAGGGAGCTGAAATNTCCTTCCACCCAAGTTGGTATTTCAAAATATGT 

NATTGACTGGATANGGGCAAAAAGGATTTGGACTAAGACACTGGGC 

Sequence 1210 

GCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAACGGCCATCCACCACTTTATAG 

ACGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MAATTGCCAAMTGCGACTTTCTAAAAATGGAGCATAATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCNGATGAAGACTCTGAGAGGGG 

TTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGA 

G ATGTG G N G CCC ACT AG 

Sequence 1211 

CGGAAANTTGGGGGGCCCCCTTNCTTAAGAAAAGGCCATTGGCTTNCCGAAGGCGGGGGC 
CCCG CCCC AAGTTGGTTGGMNTGGGGGATTATTNCTTTGGCCAAGAAMNTTTCCGGGG 
GGNGTTTTNAAGGGNGGGGGGGGGGGCCCCCCCGGGCCCCCCAAAAGGGGTNAACCCCCC 
TGGGGGGNAANAGGGGGGGAAANNTTTNNNAAACCGGNNAAAAAACCCCCCCAGGGGGGG 
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CCCCCCGGGGGGGGAAAAAANCCCCCNGGGGGGGAAACCCCCCCGG 
Sequence 1212 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAAGATGGTGGTGCCCACTAGGCTACTGCTGAAAGG 

GAGCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAA 

AGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

GGGTCTTGAGAGAGCCCAAAGGAGTTCTGGGAGAGGGACCAGATGGGGGGGTA 

Sequence 1213 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

G 

Sequence 1214 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGTCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTC^AAATATGTAATGACTGGTATGGCAAAA 

GATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAAT 

Sequence 1215 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 

AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1216 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACCTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCCCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGT 

CTTGAGAGAGCCNAAGGAGTTCTGGGAGAGGGACCAGATGGGGG 

Sequence 1217 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAMGATGAAAGACAAACTGCAA 

AAMTTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTG 
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GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGTTGGTATT 

CAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGG 

CAGGGTTATG7TA 

Sequence 1218 

CCGC6GTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGNGTTCTCAAA 

CCCGGNATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTCAAG 

AATGTGTCTGTCAAGGATGTTCGTCGGGGCAACNTTGCTGGTGACAGCAAAAATGACCCA 

CCAATGGAAGCAGCTGGCTTCACTGCTCAGGGTGATTATC 

Sequence 1219 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGATGCTACAGACGAGGACATCACCTCACA 

CATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAA 

CGCGCCTTNTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGA 

CCAGAGTGCTGAAACCCACAGCCACAAGCAGTNCAGATTATATAAGCNGGAAAGCTTATT 

GATTANAAGCAATGNGChfTTTCCGAT^GATTGATNNGTNMGNAACTTTTTNAAA 

ANCCCTGAATTNCCNNNACCCAATTAMTTTTTNCNANCCCCTTTAAAAAATTTTNCTNG 

GGNTGGGGCCCCCCCNAAMNTTAGGGNANAAAAAATATNAAANCCCCNNAAAATATTTT 

NNNTTNTTTTCTAMAAAMTAAAAAACCCNCCCNTTTm 

GGGAAAAAAAATTCCAATTTTTNCCCTTTTCNTTTTANCCNAAAAAAAAAAAAAT 

Sequence 1220 

CCGCGGTGGCGGCCGAGGTACA7TGGCACGTCACGATGTCTTGAGTTTCATTCACTAGGT 

GGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGAGGCTGGC 

TCAGGTTTGACTCTATCTTGGAAAAAMTAGGAAMCTTCATTTATGGAATAGTTTTGAA 

TAACCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACGGGTAGAA 

TGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTCTGGAATT 

GGTCTGACCCAATTAACACACGGCCTCTGATGGGMTAGATGTATTTTGGGGACACATTT 

TMTCTGATAGCTGTAACCCCTTTTGAGTTGGCT7TTGTTCACTGGAATCCCTTTCCAGT 

CA 

Sequence 1221 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCATACC 

Sequence 1222 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCMAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAMTGCAACAGGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGG 

TTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGGACTAAGACAC 

Sequence 1223 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAA 




WO 01/070979 



PCT7US01/09126 



TABLE 1 

201/467 



TGAANGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAA 

TCMGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCT 

AAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCNTTCCTTCAGGAGCTGAATGAA 

AAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCT 

TCAGCATCAAGCTGGGAATGGGGG 

Sequence 1224 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATAC 

CACTGGACAGGGTTATGTTAACACCTGGAATTGCTGGGTCTTGAGAGAGCCCAANGGAGT 

TCTGGGGAGAGGGACCAGATTGGGG 

Sequence 1225 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAACAGAA 

GATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGGTTGGG 

TATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGGACTAAAGACACTGGCCATAC ■ 

CACTGGACAGGGTTTATGTTAACACCTGAANTGCTGGGGTCTTGAGAGAGCCCAANGAGT 

TTNGGAGAGGGCCAGATGGGGGGGTAG 

Sequence 1226 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTT 

AACACCTGAATTGCTGG 

Sequence 1227 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCAGGAAGTGAAGTAAAA 

CCTGGTCTTGGTTGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGA 

GCCACAGTATMTTAGTAGGCTGAGAAGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTG 

CTCGCCTGGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAGTTGCTCATC 

AGCTTCTGGAMGCAGCTTCACCTGTCCTTTTCCCCAAGAAGCTGGGCAGCTCCCGGGTC 

AGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGMCTTGTCACCCTCTTTGCCCGAGTAC 

CTGCCCG 

Sequence 1228 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAMTTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGTC 
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TTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAAATGGGGGG 
Sequence 1229 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATNAACCAGAAATCAAGGGNGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACT 

Sequence 1230 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTCTTC 

TAGTCCGGTAAACAGAGGGCCTGCCCCCGACAGCTTCTGCTTCCGGGTCACGCCTTGACA 

GCGGCTTTCAACCCCCACCTCAGCCCAGCAATTCGTTTGGAGCATGTGAACACCTTGAGC 

CTTGATGAGTTCCAGThrTGTGGTATATTATGCAGNGCNTTCAGNGAAAATNCTTTTTNTN 

CGGGNNTTNAANNAAAAANANTNGGGTGCCATGNTNTTNCCX3CCNNNNNNTNGGGGGGGG 

GCCCCCCTCAAANGGGGGGGGGGNACTATNANNNNCCCTNTTTTTTGGGGNNCNANNTNN 

ACNC^TTTN^NGGGCCCCNTTTTTTGGGGGNAAAAAAACCCCCCCCCTNNGGGGGGG 

GTATTTTCNTTTTNNGAAAAAAAAAGGCCCCGGGGNNGACCCCCCCCCNGGTGGGNTTAN 

ANAAAAAAAAAAAANTCNCCCNNTT N 1 1 1 1 1 1 1 1 1 1 I AAAA 

Sequence 1231 

AGGTACGCGGGGCTTTTCCGT GCTAC CTGCAGAGGGGTCCATACGGCGTTGTTCTGGATT 

CCCGTCGTAACTTAAAGGGAAATTTTCACAATGTCCGGAGCCCTTGATGTCCTGCAAATG 

AAGGAGGAGGATGTCCTTAAGTTCCTTGCAGCAGGAACCCACTTAGGTGGCACCAATCTT 

GACTTCCAGATGGAACAGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAG 

TTGCTCATCAGCTTCTGGAAAGCAGCTTCATCTGTCCTTTTCCCCAAGAAGCTGGGCAGC 

TCCCGGGTCAGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGAACTTGTCACCCTCTTTG 

CCCGAGTACCTGCCCG 

Sequence 1232 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAA 

CAAAAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCATTTATTGGTGGCAGA 

AGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAAT 

TGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCAACTA 

AAATGCCATTTCACC 

Sequence 1233 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAANTGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGNGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAG 

GCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAA 

GACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTNATCCCTTCAGCATCAAGCTGGAATGGG 

GAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNCT 

CCCCAAGNTTGGTATTCAAAATATGTAATGACTGMTATTGGCAAAAA 

Sequence 1234 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTNCCCAAGTTNGGTATTCAAATATGTAATTGCTGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCCTACCACTGGACAGGGGTTATTNTTTAACCCCCTGAATTTGCTTGGGT 
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CTTTGAGAGAGCCCCAAGGGGGTTTTGGGAGAGGGGACCCANAA7TGGGGGGTAGGTC 
Sequence 1235 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCMAAAATTGCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTTGAAAGG 

Sequence 1236 

TGGA CTCCACCGCGGTGGCGGCCGAGGNACAATACACTAGAAACCAACATAATGTATTTT 

TTTTAAAACCTGTGNGAAAAAATAAATGTTCCACNAGTAGGGATAGGGGAAAAGNAACCA 

AAAGAGAGAAAGAGAAAGGAATGCTGGTTNATCTTTGTANGTNGTAATCGAATGGAGAAA 

TTTGCAGTATTTTANCCACTATTAGNGAAA I I I I I I I I I I I 1 1 1 GTCAAAATGANAGACT 

GGAACCTCTGTTCMNATGCTTTNATTGNMCTCTGGTTTTGAAGACCGGGNNNGGNAAA 

GCAANNAAAAACGTNGGGAAACCCTNNGATGGACNTAAAGGGGCNNTGGNNGCCAAAGGG 

ACCTTGGGGGAAAANGGTCCACTTTGMTANANAAGCATGGGGNGGGGNGNATTTTTCCC 

CCCCCTTTTAAAAAANATGGTNTGGMTAATTTTAAANNGGNATATTAAMACCACCTTT 

NTT 

Sequence 1237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGATATCCACATAT 

TTTCGATAAAAATTCCAAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTA 

TACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTC 

TAAMCAGCACAGCAGCTMTGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTT 

CACACGCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGT 

CCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCC 

TCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAG 

CGCCGAGGCTTTGGC 

Sequence 1238 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACNNTGGCQATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGMATTCCTCCACCAAGTTGGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

T 

Sequence 1239 

AGCTCCACCGCGGTGGCGGCCGAGGTACGCGGGGGGCAGMGAGGAAGATTTTTGAAGAG 
TGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGTGA 
GAACAATAAGAATTCCTTGGAGAGCAGCCTACGGGNAACTAAAATGCCATTTCACCTGGA 
ACTTGATGGANGGGAGAAMCTCCTTGGMTGATTTTGMGAC 

CGGCACTGAAGATTTNCAGCAATCCGTTGGAATNTCAAAAGGCCCACCAAANGGTGCCAA 

CNCCTACATGNGCNCTTATCNTAAANAGGCACCCCTCCAANAGGGGGNCNAATAAACGNA 

G GGNC NAAGNNNCCCTGGGNAAATTGGCCTTTACNGGNTAAAAAAGGCCTTGGANNGGAG 

GTTTTAAANTNCCTACGGCCAAAGGNAAGCCATTGGCNTGNACCCCAAGGGCCAAGGAAA 

AATCCANNTAAAAAGNTTCTTGGGNGTCCACCCCNTGGGG 

Sequence 1240 

AGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 
TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 
AGGCTTTGCATGTCTTGGCATTTCCTTTCAGGGAGCCTGAATTGAAAAAAA 
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Sequence 1241 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCNGGTACGCGGGGGAGACATTCCTCAATTGCTT 

AGACATATTCTGAGCCTACAGCAGAGGAACCTCCAGTCTCAGCACCATGAATCAAACTGC 

CATTCTGATTTGCTGCCTTATCTTTCTGACTCTAAGTGGCATTCAAGGAGTACGGGAAGG 

CGMGAAMGMTAGAGAAGATAGGGAMTTAGMGATAAMACATACTTTTAGAAGAAA 

AAAGATAAATTTAAACCTGAAAAGTAGGAAGCAGAAGAAAAAAGACAAGCTAGGAAACAA 

AAAAGCTTAAGGGGCAAAAATTGTACCTTCGGCCCGCTCTAGAACTAGTGGGATCCCCCG 

GGCTGCAG 

Sequence 1242 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACATGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAG 

Sequence 1243 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAT 

CAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGG 

Sequence 1244 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCCTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAA 

Sequence 1245 

TGGTACTGCTAAAGTCATGACAGCCCAACAGGTGATGTTTTACTGGATGAAACTCTGAAA 

CACATCAAAGCAACTGAACCCACAGAAACTGTCCAAACATGGATAGAGCTACTCACTGGT 

GAGACCTGGAACCCCTTCAAATTACAGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGT 

TCTCAAACCCGGTATGGNGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATC 

TGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAA 

TGACAAGAATGTGTCTGTCAAGGATGTTCCGTCGTGGCAACCGTTNCTGGTGACAGCAAA 

AAATGACCCCACCAA 

Sequence 1246 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTG 

Sequence 1247 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGQATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGMAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 
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GGACTAAGACACTGGCCATACCACTGGACAGGGTTTATTGTTAACACCTGAATTGCTGGG 
GTC 

Sequence 1248 

GAGCT CCCCGCGGTGGCGGCCGNGGTACATAAAACATTATTCCTTCCTTGGGCTAAAAAC 
TTTTTGCCACCTACATTAMGCTMTA^ 

TTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACATAAAAATG 

CACCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTNTTTAAGGTGTCCACGCAG 

NCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATNAGCCCTACCCCAGTAATCCCCAGA 

AGGAACTTACACI I I I I I 1 1 MTCTTTTCCTACAACTNCATATTTTATAAATAAAAAGAC 

ANAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCAT 

Sequence 1249 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGTGMGGAGCMGGCTGMGMCCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGMGGAGCCMCCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCMTACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTAT 

GTTTAACGCCATCTTACTAACCATTATTGCTATGGTGGTATACACTGCCTATGTNTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTT . 

Sequence 1250 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATNTGTAATGACTGGTATTGGCAAAA 

Sequence 1251 

CTNCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATGGAAGAATNGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

GAGCTTGAAATTCCTCCACCAAGGTTTGGTATTC 

Sequence 1252 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCTTGAGAAGGATTT 

GAAGGACAAGTTTGTGGCCCTGACCATAGATGATATCTGCTTCTCGCTCAACGACAACTC 

ACCAAACATCAGATATTCTGAGAACGCCGTGAGGATTGAGCCAAACTCCGTGAGTCTGGA 

AGACTGGTTGGACTTCTCCAGCACCAATGTGGAGAAGGCTGACAAGCAGCGGAACAACTC 

CCTGATGCTGAAAGCCCTGGTGGATCGAATCCTGTCCCAGACAGCCAATGGATCTGTGCA 

AGCCAGTGTGATTGTGGTGGACACCGGCATTCAAGAATGGGCCTGAAGGGATCAAAGGGA 

TGCCAGGGACAAGCTGGGCTTGATCATCTGGCCCAAGGTATTNGGAAAGAGATTGCTTCC 

CAGGGAAGAAAA 

Sequence 1253 

ACTNAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTNGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAANGGNAGATTGGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 
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CTGAAATTCCTCCCCCAAGGTTGGTATTCAAAAATATTGTAATGAACTGGGTATTGGCAA 
Sequence 1254 

CCGCGGTGGCGGCCGNGGTACAATGATTGTCATCTCAGTAAAAGGTCTATTATCTAACTT 

GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAMGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGTCTTGA 

Sequence 1255 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGG 

GGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGCTCTGGGCAGCAGCAAGAAT 

TCCTCTGCCTCCCATCCTACCATTCACTGTCTTGCCGGCAGCCAGCTGAGAGCAATGGGA 

AATGGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCANAAGCTGGGTTGGTTTATCCAGGAA 

TACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAGATGGTGAACACXJATCTGG 

GACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGNGCANGGAGTGGCCATAGGNGGCTCC 
TATGGACGGAAAAC 

Sequence 1256 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAANGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA • 

CAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

Sequence 1257 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAMTAMCCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

GATGAAGACTCTGAGAGGGGTTTGGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 
AAA 

Sequence 1258 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAAATATGTAATGACTTGGTATG 

GCAAAAAGATTGGGACTAAAG 

Sequence 1259 

TACTTAGG GCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGTTT 

TGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAATACATAAATACTAGTCCACGTTA 

ACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTG 

GAAATACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAGGA 

GATGGAGAGCAATTCTTTAAAGATAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAA 

GCCTCTTAAAATGGGACGCATTTGCCTCGCGCCTCTGGGGTGTCTGCAGCTCAGCNTTGG 

TGCCCCACACGGGACACCCGACTTTT 

Sequence 1260 
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TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCAATGTCATT 
MCCATAAA TGAG CATTTACAATCTGGATTAAATGTCACATGGTATTAAGTCTACACTTA 
GAGTMTGCTTTTA CTGATTTTTAAA AATATATGCATATGTTTAGTGATCGAGAAAAGTG 
AAATACTGGAGTAC 1 1 1 I I 1 1 I I I 1 1 I 1 1 1 1 1 1 1 1 GGCTTGATGAGTAGGTGAGTTTATT 

GGGACTTACACACAGGTCAATCCTGGGCGGCGACAAGACAGCTCTAGAGATCTGAGCTTC 
CTCCCMTGCTAMCTGCTTTCATGCTAATTTTCTGACTGTTTACTTACCCGGGGTAAGA 
GCGATGGGGACTGTTTTCATTGG 
Sequence 1261 

TNCTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACTAGGCTACTGCTGAAA 

GGGAGCTGAAATTCCTCCACCAAGGTTGGGTATTCAAAATATGTAATGACTGGTATGGCA 

AAAGATTGGG 

Sequence 1262 

TGGCGGCCGCCGGCAGGTACGCGGNGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCC 

TGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGNGAGAACAATAAG 

AATTCCTTGGAGAGCAG CTACG GCAACTAAAATGCTTTCACCTGGAACTTGATGGGAGGG 

AGAAAACTCCTTGGATGATTTTGAAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCG 

TGAATTCAAAGCCACAATGTGCAACCTACTGCCTATCTAAAGCACCTCAAAGGGCAAAAC 

GAGGCAGCCCTGGAATGCTTACGTAAAGCTGAAGAGTTAATCCAGCANGAGCATGCTGAC 

AAGGCAGAAATCAAAAGTCTGGTCACCTGGGGAAA 

Sequence 1263 

cgaggtacccaggcctgcaaatctcctggcaggatggtcaggaactgctctagcaccagc 

agttccagaatctgttccttggtatggatctctggcttcagccactgacagcagagctcc 

cggaggcggctcagtgcctcgcgaggtccaggagaatcctggtagcagaactgcctaaaa 

agctgcctgcagagctcctgcttaaggagttcacttctctgtaagcaagtgtcctgccca 

tggataaattcttcctcttctatcttcactatcagaagtccttcatcctctggagagttc 

tgggctgcagctttctttggttctgtagccatctctcggacagggctgattccgatcgga 

cacttccggtggaaggactgagcggcgctacacttcaagaattccgtccacagggacttg 

tgagtctgcgcagaa ggcg ggatgcctttggactacgattccaagaatccttcctgggtc 

tcttcgggcgcagact7ttcgccaaagtcctgaagatctcagggcttgaaggagggggca 

tccttcttcttcattgnagtagtgtgtcttgctaaataacagaagggactcctgaaaaga 

aaatgacgttggcccgggcgcgggggcttacgcctgcaagtccacactttgggaggccga 

acnggcngatcacgaaggtcaggagatcgangactatctgggttacgcggngacaccctg 

NGTTTCTTAAAATCCANAAAAAAAAAAAAAAAA 
Sequence 1264 

GGCGAATTGGACTCCACCGCGGTGGCGGCCCGAGGTACAGAGATTTATAATGTGCTGCTC 

TAGGTCCTATCGGGTA AAGG GATCAGCAGATGTGAAGTCAAGAGTCTCCTGTAAGATTTG 

A CmCT TGGAMCATATTTTMTCCTGGGCCTCCTmTTCMATCACCTAmCTTTTA 

Gn 1 1 1 iGCAGTGATACTGTGTGTTGCTTCTAACAGAGGTTCAGTTTCACAGCCTTTCCC 

TCMGTGTCTTATCCTAAAAGTAAAACCTAGATGATCTAAGGTGGTGGNTTTCAACAGGG 

TGCAAATTTGCCTCCTATACTCGCAACACCCAGNGACAGTTGGCTATGTCTNGGAGACAT 

TTTTGNGTTNTCACACCTGGANNNAGGGTGGGGGAGGTGGNGCTAATGACANCAAGNTGG 

CCNTAANCCAATNATGCTGATANAAATNCTACANTGCACAAGGATAGGNTCCCACANAAC 

ANAAGNCTTANCCAAACCCCAAATACTAACAAT 

Sequence 1265 

CCGCGGTGGCGGCCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTT 
GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 
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TGMMGCMTTAGAGTMGCMGATTAGTTTTTCCTCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGG 

CTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACC 

AACTTGCCCAGGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATC 

TTCTCCATACCAGCTTCCTTAAGCAAAAGCAATCCTCTCTTTGCTTCCTCAAGGGGCAGC 

ACMAGGATGTTTTGGCTGTGTGGAAACAGAAGCCCGCATTTGTAGTTGCACTGGCGAGT 

GAAGTGATAGTTGACGCTGGTTGGGGTGGT 

Sequence 1266 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACTTGCTAACTTTGACGCCAGCATCTCTGAAAG 

ATCCCCATCGMGGCCGGTCATTGCAMTACAGGCTGTTCCTTTTCACCCTTGATCTGCA 

AGACATCAAGTGGAACTGTCTCTCCTTTCACMTGGCAAGTGTGGCATCAGTAATATGTT 

GGACTTTGTTTCCACTTTCGGCAAAGAGGGTATGACTCAAACTACTGGTCTCTCCCAGTG 

GGATAAATCCAATGGGAATCTTACTGAAGGTAGCCTCATCTGTTCGTCGAAGAACACCAA 

GTAACAACCTNCTGCAGTGTCCCATCTCCTTCCTGNAACAATGGATCACATTCCGGNGTT 

TTO:ATCAGTTNCAAGGAGGTTTNTTNGGCTTGGGCCCTTAANAAATCTGNGC7TAAACA 

AAAAGGCCCNATTCCTTCCCAAATAAAAANGGAAAAANTCGGGGGCCNCAA l I I I I II I 

Sequence 1267 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAAT AAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGGACAGG 

Sequence 1268 

GGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAA 

ACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGC 

GACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 
CACCTGAATT 

Sequence 1269 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTACAG 

AGATCCTGAAGAGATTGAAAAAGAAGAGCAGGCTGCTGCTGAGAAGGCAGTGACCAAGGA 

GGAATTTCAGGGTGAATGGACTGCTCCCGCTCCTGAGTTCACTGCTACTCAGCCTGAGGT 

TGCAGACTGGTCTGAAGGTGTACTCTTGGTTTATCAATGGGACGTTCCAGCAATCCACAC 

AAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGCGGATCCTATATGTGCCAAGCCC 

ATAACTCAGCCACTGGCCTCAATAGGACCACAGTCACGATGATCACAGTCTCTGGAAGTG 

CTCCTGTCCTCTCAAGCTGTGGCCACCGTCGGCATCACGATTG 

Sequence 1270 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTMAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGMGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGA 



m 



WO 01/070979 



PCT/US01/09126 



TABLE 1 
209/467 



Sequence 1271 

ACACTACTT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAT 

TTTTATTTTGAAMCCATTCAGCACATTTATCTTATGTAACATGCAGAGCATATATCTAT 

CTGTATTTTTAAMTTTTCCT^ 

AGTCCGTGAGAAACATGGCGACTCGATCAATGCCCATGCCCCAGCCAGCTGTGGGGGGCA 
GCCCATATTCCAGGGCAGTACTCAMGGTGATATTTGCTTTTTTCAATGCTTCAGGGGAA 
AAATCCTTTTCTTTA 
Sequence 1272 

TAAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAANGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAMGACAAACTGCMAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATT 

Sequence 1273 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGGTGACGTGGTTCCCAAAGATGTC 

AATGCTGCCATTGCCACCATCAAAACCAAGCGCAGCATCCAGTTTGTGGATTGGTGCCCC 

ACTGGCTTCAAGGTTGGCATCAACTACCAGCCTCCCACTGTGGTGCCTGGTGGAGACCTG 

GCCAAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAA 

AATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAA 

MTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

Sequence 1274 

ATCCCCGGCGNGGCNNNNCGCCCGTTCAGGTACTCTTTGTTTTGGCACACTTTTCCTGAC 

AAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGTTAACAACAATAGCATATGAG 

ACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTGGAAATACX)TTGGAGGGTT 

TCACAAAAATAAGACAMGGGCAAAGGAACTTTGCCAAAGGAGATGGAGAGCAATTCTTT 

AAAGTTAGTGQGAGGGAGGAAGCAAAGAGCTCATAAATACAAGCCTCTTAAAATGGGACG 

CA7TTGCCTCGCGCCTACTGGGTGTCTGCAGCTCAGCTTGGTGCCCCACACAGGACACCG 

ACTTTAAGTGGCTGCCTTTGCAAGGCTGAGAGGCCATGGAGGGGTTGATGCCTGMGTGT 

CAGCGCCATCTAGTGGAAACATGGGGCATGGCCC 

Sequence 1275 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTNCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1276 

AGGTACMAATTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGAATTTT 

TGTMTATGCTGTTTCACMTCTTTTTATTAMTTACTTAATGATTCCAGTCTGCAAAAT 

GAGCCATMGACTTTGCTNCTGTTTGNTATANGATNCATNTGGANATTGGGGGNGGGNAA 

ACCATANGTAAGGTTAAACCTATCCGTCACCTGCTTCATGTAAAAGACTCCACCATTTGN 

TTGGATNTATTTTTTTCTCCAGGCNACTAGTAAGMAAMGGTGAACAAAGGTGGATTTN 

CAT CCCT NNCMANTGGGCCCTTNTGGCNCMTTCTTTTTTCANTAATCCTATGGTANAC 

CNNTTTTGGTAGATTACCTTGGTGGTN^^ 

AAAAATTCTTTGGGGATTTAAAATTTAAANCAAAAAAACCAAACCAAATAAAANTTCTAA 
TCCACCCNTGNGGAAAATTATTTTGGAMAGGAAAAATTTTCAGGTTAAAAAACAAAGGG 
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ANTGGNTGGTTCCTTCCATTAGGTTTAAAGGGGAGGAGGNANNATTAAAAAAAATTAAAA 
AAATTG GTTC ATTTTAAAAC AAG GTTT N G AAATTT N AAG G G AA 
Sequence 1277 

AGGTACAAMTTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGMTTTTC 

TGTAATATGCTGTTTCACMTCTTTTTATTAMTTACTTMTGATTCCAGTNTGGCAM 

GAGCCATAGACTTTTGCTCTGCTTGTTATAAGATCANTTGGAATTGGGNGGGGGGGANAA 

CCANTAGTTMGNCTAAATCTAATCCGGTCACTTGGCTTCATTGTAAAAGAACCCCACAA 

TTGGTCTGMTTAATTTTTTGCCCAGGC^^ 

GAANTACATTCCTTGGCAAANTGGGGCCCTCTTTGGCCCAAATCCI 1 1 I I I CC AATTATC 

CTTATTGGTTAAAACCCTTTTmGGTTAAGTNTNACCTTGGGTGGTTGGMT 

GCCGGNCTTGGNNGGCCTTMTTTTC^ 

MCC CAANMNAACCANAC CAMN AAATANTTCNyTNATTNCACCCCTTGGGGNAAATTNA 
TTTTGGGAAAAAGGAAAANATTTTCNMGTTTAAAAAACCCAAAGGMNTGGGTNGTTCC 
TCCAATTANGGTNTTAAAAGGG 
Sequence 1278 

AGGTACATTTACACAATATTAACACTAAAAATCTGTGTTTTTTTAAAACACCATAGAAGT 

CAMCCACMAAACCCAGGATCTTGTTTTAAATGTGTTTATGAAGACTGCTGCTGAGCTC 

AAAAGCATTGCAGGTAATCATGACCACCTAGATGAAAGCTGGATGTTTGAAAACTCCTTC 

ATGTCCAATGAAT GTAAATTTTTTC ACCTCATCCCCMGGTATTCTCCCATACTTTGTTC 

TACTTTTG ACCTTC 1 1 1 I I SI I I I I GGGNCACCTCTTTTCATGGCATAAGGGCCTNGACT 

TGAGGGGGTACAGGTTCTTTTTNGTGGTrrTAAAAGGAATTACTTTTCATTM 

CTCCTTGGTTTCCTTTAATTTCCCTTTAAA 

TTTTTCNTTTTTAAGNGGACC 

Sequence 1279 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAACGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAG 

AAMGATGAMGACAAACTGCAAAAMTTGCCAAMTGCGACTTTCTAAAAATGGAGCAG 

ATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAG 

CAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 

AAATTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1280 

GTGGCGGCCGAGGTACCAANTGAAGTGTGAATGACAATGGCCATCCANTANTTTATAGAG 

GGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAA 

ATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTTGCATGTCTTGGC 

ATTCCTTCAGGAGCTGA 

Sequence 1281 

CCGCGGTGGCGGCCGCTCGGGCAGGTACCATTCCTCTACATCCATTTGGTAGCAGAACCT 

CAAGTGTAAGCAGTCAGTGTAGCATGMTATGMCTGGCTCAGTTTATCACTTCCTGTTT 

NGACCTGAAGCACCACCCAGCTATGCAGAAGTGGTAACAGAGGAACAAAGGCGGAACAAT 

CTTGCACCAGTGAGTGCTTGTGATGACTTTGAGAGAGCCCTTCAAGGACCACTGTTTGCA 

TATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTATTCAGAGATTGATCCAAATCCT 

GATCAGTCAGCAGATGATAGACCATCCTGCCCCTTTTGTTGAAGGAACACTTGGTTGA 

Sequence 1282 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCCTTCCTTGGCC 

TAAMACTCATCGCCACCTACATTMAGCTMTATGCCTGATTACTGTTTTTAGAGAACT 

TATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACAT 

MMATGCACCCTAGACCCGAAACTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTC 

CACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAAT 

CCCCAGAAGGAACTTACAC I I I I I I I I AATCTTTTCCTACAACTTCATATTTTATAAATA 

AAAAGACAAAAATGTCAGGCCTGTG 
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Sequence 1283 

GGTGGCGGCGCCCGNCAGGTACCAAATGAAGTGTGAAGACCNGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCATGGGAGAAAGAAAAGATNAAAGACAAACTGCA 

MAMTTGCCAAMTGCGACTTTCTAAAAATGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACANGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCANCATCAANCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGNAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

GACAGGNTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCNAGGAGTTCTGGGA 

NAGGNACCACATTGGGG 

Sequence 1284 

CGCGGTGGCGGCCGCCCGGGCAGGTACCCCGGGAGAGCCCGCTTCCCCCTCCTCCCTGTG 

CTGTCTGCACCCGAGGAGAGCGGCCTGCCCGGAAGTGGGCCACCATATCTGGAAACTACA 

GTCTAT GCTT TGAAGCGCAAAAGGGAATAAACATTAAAGACTCCCCCGGGGACCTGGAGG 

ATGGACTTTTCCATGGTGGCCGGAGCAGCAGCTTACAATGAAAAATCAGAGACTGGTGCT 

CTTGGAGAAAACTATAGTTGGCAAATTCCCATTAACCACAATGACTTCAAAATTTTAAAA 

AATAATGAGCGTCAGCTGTGTGAAGTCCTCCAGAATAAGTTTGGCTGTATCTCTACCCTG 

GTCTCTCCAGTTCAGGAAGGCAACAAGCAAATCTCTGCAAGTGTTCAAAAAAATGCTGAC 

TCCTAGGATAGAGTTATCAAGTCTGGAAAGATGACCTCACCACACATGCTGTTGATGCTG 

TGGTGAATGCACCAATGAAGATCTTCTTGCATGGGGGAGGCCTGGCCCTGGCCCTGG 
Sequence 1285 

CGCCGGGCAGGTACCAAATGAAGTGTGAAGACANGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAlGAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACfGGCCATACCACTGGACAGG 
Sequence 1286 

TCGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGATATA 

CTGGAGTCACAGAGCAATATGCATTAACAGGATACAACAGTTCATAAAAACTGAGTAACT 

ATGCACACAAATTTCTTAAACAGCCACCTAAAGAGAAAATGCACAGATGTATGGTGGAAA 

CTGTATCTAACACTGAAACTACTACAGGACTCCATCAATGAGTCCAACTTTTAGTGATAA 

AAAACTACTGTACACTACATGAAGAACCATATGTTTATAATTATCCAAATAAAAATGAAG 

TTATTAAACTTCAAGATMTATGGTAATTTGCATTGAACCGATGATTFTACAAAATTCTG 

CAAAGGTCAAMTTTTAAAAGATGGCTGMCAGTAA7TGCAGCATCTAATAAAAACGCAG 

CTCATTACCGAGCMACGGTTTTAATTAAAAATTCAAAAGGAATAATCCTGACAGGAGAA 

ATAAAAAAATAGATGTCAAAAGAAGATAAAATTATTTTCAAAGGAGTAGTAACTCAAGTT 

TTAACACC 

Sequence 1287 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATA 

CCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTT 

CTGGGAGAGGGACCAGATTGGGGGGTAGGT 

Sequence 1288 

CCGGGCAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTG 
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TTCACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTTGAAAAGCAA 

TTAGAGTMGCMGATAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGC 

CCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGAATG 

CCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGGCTTCCATTGC 

TCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACCAAC7TGTCCA 

GGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATCTTTCTCCATA 

CCAG CTTCC TTAAAGCAAAAGCMTCCTCTCTTTGCTTNCTCAAGGGGCAAGCACAAAGG 

GATGTTTTTGGCTGTGTGGAAACAGAAAGCCCGCATTTGTAGTTTGCACTGGCCAGTGAA 

GTGATAGTTGACCCTGGTTGGGGTGGGGG 

Sequence 1289 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CAMGTATTATGACTTTAAAAACCCCATTATTGAAAAGTACACAGGAGGCAAAGTGTTTC 

ACATCATAGACTTCACTTCCAACTCCTTGGMTGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGCTTAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCAAGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCATTTCTGGACGCTCAGGCCGTGTCGGGCTTNAAGTCA 

TCTTNCACCACACAGGTACCTTNGGGCCGGNTCTAGNAACTAGTGGGATCCCCCCGGGCT 

GGCAGGAAATTCGAATATCAAAGCTTTATCGATACCCGTCCGACCTTCGANGGGG 

Sequence 1290 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGTTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCGACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 

CACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTG 

Sequence 1291 

AGGTCATAAMCATTATTCCTTCCTTGGCCTAAAAACTCATCGCCACCTACATTAAAGCT 

MTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGGCAGTTCCAAGCTCAAAA 

ATCGCTAACTGGCACCTTGTNGTACATAAAAATGCACCCTAGACCCGAAACTTACTAGAC 

TCATTATAAAATTTTCTTTAAGGTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCG 

GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAACTTACAC I MINI lAATCTT 

TTCTACAACTTCATATTTTATAAATAAAAAGACAAAAATGTCGGGCCTGTGAGCTGAAGC 

TTAGCCATTGTAACCCCTGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAG 

AAGTCTGGAGCAGNCGAAAAACCACAAAGAAGTGAAACAGCCAGGTTTCTGNCTTAACTA 

ATTAACCCAC 

Sequence 1292 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCC 

TTCCTTGGCCT AAAAA CTCATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTT 

TTAGAGMCTTATnTATTAGGGCAGTTCCMGCTCAAAAATACGCTACTGGCACCTTGT 

TAGCTACATAAAAATGCACCCTAGACCCGAAACTTACTAGACTCATTATAAMTTTTNTT 

TAAGCTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTAC 

CCCAGTMTCCCCAGAAGGAACTTACACTTTTTTTTAATCTTTTCCTACMCTTCATAT^ 

TTATAAATAAAAAGACAAAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCC 

TGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAGAAGTNTGGAGCAGCCGA 

MMCCACAAAGAAGTGAMCAGCCAGTTTCTGCCTTAACTAATTAACCCACCTTACGAC 

ATTCCACCATTATGACTTTGTCCACCATTATGACTTGTTCCTGGCCTGCCCCAACTG 

Sequence 1293 
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CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAMTGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCC 

ACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGT 

AATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCAGTGGACAGGGTTAT 

GTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGA 

TTGGGGGGTAG 

Sequence 1294 

CGAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTG 

CTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTG 

CCCAACTGGGTTTCCCAAGCTATGTAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGG 

GCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGC 

ATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCA 

GTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTC 

CTTGCTCTCGCAAGTGGCCCTGGCA 

Sequence 1295 

CGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCTTGATCCCCAGAAC 

TGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCTCAACAACCTCAGC 

CAGAGTGTCAATTTCTGTGAGAGAACAMGATTTGGGGCACTTTCAAAATTAATGAAAGG 

TTTACAAATGACCTTTTGMTTCATCTTCTGCTATATACTCCAAATATGCAAATGGAATT 

.GMATTCAACTTAAAAAAGCATATGAMGAATTCAAGGTTTTGAGTCGGTTCAGGTCACC 

CAATTTCGAAATGGAAGCATCGTTGCTGGGTATGAAGTTGTTGGCTCCAGCAGTGCATCT 

GAACTGCTGTCAGCCATTGAACATGTTGCCGAGAAGGCTAAGACAGCCCTTCACAAGCTG 

TTTCCATTAGAMGACGGCTCTTTCAGAGTGTTCGGAAAAGCCCAGTGTAATGACATTGT 

CTTTGGATTTGGGT 

Sequence 1296 

CGAGGTACAGGAGCAACCTTCTTTCCACCATTACTGGGMTTCCACCACTATTTGCTCCC 

CCAGCCCAGAATCATGATTCTTCTTCATTCCATTCAAGGACTTCGGGAAAAAGTAATCGA 

AATGGTCCCGAAAAAGGTGTAAATGGGTCAATAAATGGAAGTAATACATCATCTGTAATT 

GGTAT CAACAC ATCTGT AC I II I II I I II II I I I U I II I ATCTAAAAGCAACATAATTA 

TTTTTCTTGCGATTTTTTCMGAACTCTTTTTM 

AGCTTCTGATT 

Sequence 1297 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCMAATGCGACTTTCTMAMTGANAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGMGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAMTA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCA 

Sequence 1298 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

AACTGCAMAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGC 

ATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTG 

AGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAA 

GAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAA 

GTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCA 

TACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAG 
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TTCTGGGGAGAGGGACCAGATT 
Sequence 1299 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACC AGAAA TCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCAAAATGCGACTTTTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCC 

TTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGT 

CTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAAT 

ATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1300 

CGNCCGCCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAAATGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATANAAGATGTGNT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGG 

Sequence 1301 

CNMTTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAGTATGGCTTAAAAGGCTCTGCCTT 

AGATTCTAGAATCCAGMCATTTTCTCCAAAGACAATCAGGGTATGGGGGAGAAGTTAGT 

TCCA GAGAAGA GAGCGAGTCCAGGGTAGAAGGGATTCTTCTCTCCTGAGGGTCTATGGTC 

TCCCAI 1 1 1 I IAAAGCAGCAGNGGTATCTATCCCACTCATGGCCTAGAGGTTGCACAGAG 

CTGTCTGGCACCCGCTTCTTTGGCTTTTCTCTCCTGACACCCAGCAATGCTTACTCAGAG 

CGTTGAAGGCGGCCAGCACCTCGAAAGAGATTCTCTGATTTTTTGTGAACACCTGGATGG 

TGMCCCATCMGGGACTTCTGGATCTCGAMTTGTTTTTCAACCCTTCGTGAACAGACA 

GAACTTCAGCTTATCCC 

Sequence 1302 

CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGC^CTTTGCATGTCTTGGCATTCCTT 

CAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCT 

GGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGC 

CCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATAT 
GTAATGACT 

Sequence 1303 

CCGGGCAGGTACTACCATGCCGGGCCAA I I I I I M I I I GTTTGTAGAAATGAGGTCTTGC 
TATGTTGCCCAGGCTGGTCTCAAACTCCTGGTCTCAAGCGATCCCCCCGCCTCAGCCTCC 
CGAAG TGCTGGGATAAAAGGCGTGAACCACCATACCCAGCCAGTATTATCTTTTCATTTC 
ATTTTCCAGTTGAGTATATTATTGGCTACATTTGCATACCGC^ 

MMCCAATATTTTGTTTTGTTCTGTTGTCTACAATAAGGAGAA7TCAGATGATAAACTT 

ACAACCAATCATGGCCAAGTCCACTTGAGGAATTGTCTCTGTAGATTTATCTGTAGACTC 
CCTAATA 

Sequence 1304 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGAGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGA 

GCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAG 

CCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATG 

Sequence 1305 

AGGTATTCGACCCACGCGCCCGTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAAC 
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CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAGTnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnGCTGTTTCCTGNGTAGMNATTGTTATCnnnnnnnnnnnnnnnnn 

nnnnnnnnnNAGCCCGGGGAGNCATAAAAGTGTAAAAGCCTGGGGGTGCCTTAATGAAGT 

GGAGCTMCCTCCACATTTAATTTGCGTTTGCnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnCTGTCGGTGGCCAGCTGCATTAATGAAATCGGGCCAAACGCCGCGGGGGAAAGAAGG 

CCGGTTTGCGTTATTNGGGCCGCTCTTCCCGCTTTCCTCGCTCACTGGACTCCGCTGNCG 

CTCGGGTCNTTACCNGCTTGCGGGCCGAAGCCGGGTAATTCAAGCTTCAAC 

Sequence 1306 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTtCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTG 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTT 

CCAGGAAATCCCACCCTGGGATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCAC 

CTAATGAGTGCTCTAGAGATCAGCACCAAAGATGCTTGCCAGAGTCTTCTCTATATGTCC 

TCCCTCCTGTATCAAATAAGTCACCAAGTATAGCTGACTATCTCCATAACATCTCTCCAG 

TTGTGGCC7TAGTTCCAGCCTTCATCACTCTGCATTCTGAGGAGCTGATATCTGTCTCCT 

GCTGCCAGCCCCTCACCCCCAGTCGGTCTCCAACATAC I 1 1 1 1 I I I I I I I I i I I CGGACN 

CGTGGGTCGAAAGCCTTGACCCTGCC 

Sequence 1307 

AC I I I 1 1 I I I I II I 1 1 1 I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCA 

CCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCCCCTATTATTTTCCTAC 

AAGTTGCTTTTTACTTGAGTAGGAACCCTTGATTGATTTTTGCGGACGCGTGGGTCGAAG 

CTTGACCT 

Sequence 1308 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCAC I II I I I I I I I 

I! I I I II I I I 1 1 I I I I GCCTTCAACAAAAAAGGAATTTATTGGCTCACATAAGNGAAAGC 

TGANAAATAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAG 

ACTCCCTNTCACCCTGGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTCAAGCTTC 

GACCCACGCGTCCGTCTTANCCTACAAGGNGGCTNTTATGTACATAGNTGGGTAATACAT 

CCAATTMTGATGT^ITGACATGCTATTTTTGTAGGGAGAAAATATGTGCTAATGATATTT 

TGAGTTAAMNATCTTTTGGGGAGGATTTGCTGAAMGTTGCACTTTTGTTACAATGCTT 

ATGCTTGGTACAAGCTTATGCTGTTTTAMTTATTTTAAAAAMTAAAATACTGTTTGTN 

AGAMCCANCTGGTTTAGAAAAGTTTNNTATGTGACGATAAMCTMGAAATTTCCCTTN 

TTATTCCTAGTATTTTTCAGCACTCCATNAATTCTATTACCCTAA 

Sequence 1309 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAAGACCCAAGAGAGCTGATGGTGCAGTTCCAGTCTCACAGAAGTCTGCTGGAGAAT 

TTCCTCTGGATTGGAGAAGGTGGTCTTTTCATTCTACTCAGGTCTTCTATTGATTGGATA 

MGGTGCACATACATCATGGNGGGCMTNTTTTTTACTCANAGGACATCTGTGCCAATGT 

TATTCCCATNCAGMACAGCCTCCMGTTGACACATNAMTTTAACCAATACACTTTTCC 

TTATCTTNACCCAAAANTTAAGGTTAMGMGTTTTNGGATNNCCCTGAATAANNNATTG 

GCCA TTNGGAATGCCCACTTTTTNCCTTTTTTGGGNGAAAAMTTNTAAAATTNACCNNT 

TTTTNTNMMNTAMAMAACTTGTTGGGCTTANGGNCCACCTNGGTTTNCANCCTTAT 

TMTTCCCAGCCCNTTTTTNGGANGGNCCCNANNGNCAGGNNNGGATTCCCTTTGNGCCC 

CCCGG GMNTTTCMNNNNTTTNNGCCCNGGGGNTAANAMNAAGGNGNAMTCCTNTTT 

TTTTTCANAAAAAAAAAAAAA 

Sequence 1310 

GGGGGGGCATTTCACGCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGG 
GCAGGTCGCAC I I II II II I I I I I I N I II I ACCTAGCCCATGTTACCAATCTAAATGAA 
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CTGTCTATGCATAGAAAACTGCTTTATGCCTAAGATAATTACTGGGATTTAAGAAAGTGA 
GAAAAAAGAATAGGTGGGATTGAGAAATTAGGTAAAAACAGAAGAGGCCAACTAAACCCA 
AGTGCTGCCCTTCAAGGGCTCTAGTAACCGGACGCGTGGGTCGAAGCTTGACCT 
Sequence 1311 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTT 

ACAGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAG 

CCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATGCTATAAAAGAGGAGCATTAG 

AAATATTTGAMGATAGAMTGAGAACTAGTCAATATTTATTTTGCTTAGCACTGTATTC 

AGTATTATGGCATCTTAMGTAGTTMGACTCAATATTTCATCAAAAAAGTTTAAAATCT 

AATCAGAGAAT 

Sequence 1312 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGGCCACATTTTCMTTTAGCCATTTTTCTCTTATTCACCTTTTTCTGC 

TGTAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCTGAGAATT 

TTATTAGCTGCTMCGCTTGTCTCATCATMGCACTCATATGTTCATTGAGTAAATATTT 

ATTGAGTATTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGGTGATTGA 

AACATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAAAGTATGTTAATATCAAAAAA 

AAAAAAAAAAAAAAAAAAAAGGN 

Sequence 1313 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAAGCCTCACTCACAATTATT 

TTTGAGATAGCTCCCAATGAGTTTAATCACTGCTATGCCAGGTGTGTGAGGCTGCTGTGG 

GACAACAATCTTGATTCCTAGAAGAGTCATAAATTTCTAGGGACTACAGGCTCCTGCCAC 

CATGCCTGGCTMTTTTTGTAGAGATGGGGTTTCACCATGTTGCCCAAGCTGGTCTTGAA 

CTTCTGAGCTCAAGCGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTG 

GGCCATCACGCTTGCCCTAGAGTAATATTCTCTATTATCAAGGTAGAAAGTTCAACATAT 

ATTC ATTAGATCTACTTTATAGATACTGTTACTCAGATCACTTATATCGTTATATGTATT 

TTTTGTCTTCTTMCTTCMGTCTTGGATGAGAGMGAGGTGTTTTAAATTTCTCTGTTA 

TTTCTAGGGTTCTATTCATTTT 

Sequence 1314 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGATGACAACATC 

AAAACATACTCTGATCACCCCGAGAAAGTAAACAAAGATGATGAGGAATTCATAGAAAGC 

AATAAAATGCATGCTATTAATGGAAGAATGTTTGGAAACCTACAAGGCCTCACAATGCAC 

GTGGGAGATGAAGTCAACTGGTATCTGATGGGAATGGGCAATGAAATAGACTTACACACT 

GTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCCGGGCAGGTGCTGTGAGTGCTCTGG 

CGMGTTTGGAGCCCAGAATGAAGAGATGTTACCCAGTATCTTGGTGTTGCTGAAGAGGT 

GTGTGATGGATGATGACAATGAAGTAAGGGACCGAGCCACCTTCTACCTAAATGTCCTGG 

AGCAGAAGCAGAAGGCCCTTAATGCAGGCTATATCCTAAATGGTCTGACTGTGTCCATCC 

CTGGTCTGGAGAGGGCTCTGCAGCAAGTACCT 

Sequence 1315 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAGAT 

CAGACGTGGCGACCCGCTGAATTTAAGCATATTAGTCAGCGGAGGAAAAGAAACTCTGAA 

TCCGACCAGTGTAGGTGATTACATTAGCCTTTGAAGTCAACACAAAGTTTAAAACACCTG 

CCCGGGCGGCCGGCCGCCCGGGCAGGTGTACAAGCTTCGACCCACGCGTCCGGCTGAAGA 

CATCCCTAGGGCAGGTAGCAGAATACCTAATTCAACCTAGAGAGGCACAGGCTGCACGAG 

AGTCTCTCAGATAAAGCCCCATTGAAMTMATTTACAATCTAAMTTTAAAAACCCGTT 

AAAAAAGCAGCACAGCATGAGGAGTCAGTAGATACACTGAAAGCAAGATTAGATCTTCAA 

GACTTTCAMCTATAAAATTTAGAMATTATAATAAATTATGAAATAGAGGCCCTTTCAT 

GTCAAAAAGTCATGAAAG 

Sequence 1316 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCG 
AAAAGATGAGGCAACAAGTAAGAGAAAACAGCATTGAGCTTAGAGAATTGGAGAAGAAAT 
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TAAAAGCAGCTTACATGAATAAAGAAAGGGCAGCTCAGATTGCTGAAAAGGATGCCATTA 

AATATGAACAAATGAAACGTGATGCTGAAATAGCCAAAACCATGATGGAAGAACACAAGA 

GAATAATAAAGGAAGAGAATGCTGCAGAAGACAAACGAAACAAAGCGAAAGCACAGTACC 
TGCCCG 

Sequence 1317 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGCATAG 

GGACTCTTGCCTTAAGGAGTGTAAACTTGATCTGCATTTGCTGATTTGTTTTTAAAAAAA 

GMGAAATGCATGTTTCAAATAAMTTCTCTATTGTAAATAAAATTTTTTCTTTGGATCT 

TGGC AAAAA AAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACACTTGTTGAT 

AAGAGTTTTCTGAAAACAGTCTATCAAATATAAAGAATGGTTTCTATCCAAGAATCAGCA 

GTGAGGGAAGAAATACTAAACACCTGTCAAGAAATCAGTTATTCATTTTAAAAAATAACA 

GAACCAGTGCTGCTCTCTGTCATAAAAAAGAACATGTAAAATTTATTTTTATAGGCTTTG 

GTAACATTATATTCCCCACAGAGGCCTTCAATCCTACTTAAAGATA 

Sequence 1318 

AGGTCAAGCTTCGAC CCACG CGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGGTT 

TAGTTGGCCTCTTCTGTTTTTACCTAATTTCTCMTCCCACCTATTCTTTTTTCTCACTT 

TCTTAAATCCCAGTAATTATCTTAGGCATAAAGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCGG 

GCGGCCGGCCGCTCGTGATCTAGATCGCCGACCT 

Sequence 1319 

GCGAATTGGAGCTCC CCGC GGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTTT 

GGGTGGMTTATAATATTTTAGATAAGATTTAAGAGGATTCTAATTCTAGCTACTTGATA 

GGAATGCGAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCTAGCACCCTGAT 

ATTCTGTMTTG AAAC AGAGTTTCAAGTCCTTTGGTCAAGTATTACCCTTATTCCTTCAG 

GMTAGTAGATATTTTAAGATTACAGATAGGTTATCTTATCTAATTTACCTACCTATTGT 

TGAAATTATTTAATTTGCATTTAACTGTGTTTTCACACCTGCCCGGGCGGCCCTCTTACC 

TGCTTCTGACCTTATGCTCAAGAACTCCCCTAACTCTGGCCAGAGCTCAGCTTTGGCAAC 

TCTGACCGTTGAGCAGCTCTCATCCCGGGTTTCCTTTACGT 

Sequence 1320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I 1 1 1 1 1 1 1 1 I 1 1 M 1 1 1 1 

AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCACCAAGAGCCACTATAAAA 

AGATCCTTMTGAGCAAA ATATA TCCCCTATTATTTTCCTACMGTTGCTTTTTACTTGA 

GTAGGAACCCTTGATTGATTTTTGCGGACNCGTGGGTCGAANCTTGACCT 

Sequence 1321 

GG GCGAATT GGAGCTCCCCGCGGNGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGT 

TTGi 1 1 i 1 1 iCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGCCATTATNATTGC 

TTACTCTCATCGCTTAGCGCCC CAGG TGGGATGTGTTTCCAAMCACATTTTTGTATTTA 

TMGGAMTGTAGTTAGGATTMTTTTATTGTCCfMTTAGAACTCACATTTTGGTTAAA 

TCCTC AATTTC ATTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

ATTTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTNTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAAATCAAATTTTTTA 

Sequence 1322 

CCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTCCGCTCACTTCATCCTCCCAG 

CAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTGGGGCTTAAGGAA 

ATTAAAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTCAGAAATTTGATA 

AAAATAGCACTTTGCATTTCTTGAATCTTGAGCTAAATGGAAATTAATACTAAACATTCT 

CCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAGAAGAAAAGGAAA 

TGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGTTAAGGCTTTGGCACCTGCC 

CGGGCGGCCGGCCGCCCGGGCAGGTTCAAAGACTACCAAAGTATGTATTTGATTTTCACA 
TGCAAACAACTTAAA 
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Sequence 1323 



CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGCAGAMCCTTGCCATCATTCTTACTGCTGGTTTGCATCTCATTTATGGTGNNTCTGGG 

ATTTCTTCTCAT GAACA GAMGATTCAGTGGCTGTTATATGCCTTGTCAATTTAATGTAT 

TGCCCTATCCTCTTTTTGATCAAAGATAGAGACTAAGACTGGGAATTATGACAGAAAAAG 

TCATATTTTTCTTTAAATGATTTTGAAATGTTAAAATAGGCCAATATGAGTCAAAGTGCA 

AATTTTTTGGTGACCTGCCCGGGGCGGCCGGCCGCCCGGGCAGGTACTMGCATTTCAGT 

TCCAGGAGAATAAAAGAAATTCCTATTTGAMTGMTTCCTCATTTGGAGGAAAAAAAGC 

ATGCATTCTAGCACAACAAGATGAAATTATGGAATCAAAAGTGGCTCCTTCCCATGTGCA 

GTCCCTGTCCCCCGCCGCCAGTCCTCCACACCCAAACTGTTTCTGATTGGCTTTTAGCTT 

TTTQGTGG I I K I I I I I I I 1 1 1 

Sequence 1324 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCCTMTATATTTACTCTCTGGTCCTTTACAGGA 

AAAGTTTGCCAACCTCTGGCTTAGATGATCACCTGAGGCCMGGAGCCTCGCCCTTGAGC 

ACAAGACTATGTAGTCAGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGA 

MGGCGGAGAAGAAATGGATTTGGATAGGTAGTCAACMTGTCTGTTTTATGTTACCACA 

CATTTTCTCGAGAAATTTCMTCAGCTCTCTGAGMCAGATTCATCTTTAAATGAATGTT 

CATAGGTAACAGCAACTCATGCATCMTGTTGCAAAGTGAGCTCATTTTCACATTGCTTC 

AGGTTAGGCAGAAGGTTTGGTAAAGGGATTMCGTAATTGTTTCCTTGNTGTTTACAAAA 

AGAAGTCCCAGTTGGCATGCCACATAAAATCTTCTGNATCTCACTCTTGGTTACATTTC 

Sequence 1325 

CTATAGGGCGAATTGGAGCTCCCGGCGGTGGCGGCCGAGGTCACGCGTCCGCAGAGGCCC 

AGGCTCCCAAACCGACAAAGTGAAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCT 

GTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAGGAGAGTTG 

TTCC TGTATGTGGCTGCTCTCAGATCTTTCCAAGCAAGCCAGTCATTTGAAGAGGTTTTC 

TTTTCATGCTGGAGGGCAGGCTMGATCMTGAGTGGMGAGAGAMGGCTGTTTTAGCT 

CAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGGCCACCACA 

CATGAGAGAGCGCTCTNTCCTTAA 

Sequence 1326 

GCGAATTGGAGCTCNCCGCGGTGGCGGNCGCCCGGGCAGGTACCAAAATAATTACCAACA 

NTACATTATGTACACCATTTACAGGAGGGTMCACAMCCTTGACAGGTAGTAACTTTTC 

ACCCCACATNACTGAACGCTTMCACTCCTGGCTGTTMTTGTCAGTTCAGTGTTTTAAT 

CTGACGCAGGCTTATGCGGAGGAGMTGTTTTCATGTTACTTATACTAACATTAGTTCTT 

CTATAGGGTGATAGCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 1327 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTGTAGCCTATGACTTG 

AGTCTCTTGMCTTCTAGGAAGAGGCAAACTACAAACTACTAGGATTCTGATTTCAGATA 

TAGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAACTTGA 

ATGGCCTTGGTTGTTCTGTAATTGCTGCCAAAATCATCACAAGCTGTACCTGCCCGGGCG 

GCCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGATGGGAATTCAGGTATGA 

AAGAAMCAGGCAAGGAGGCACTGAGGGAGAAAGACACAGACTTTATCGCTCTGTGGCTC 

ATTGTTACTGGMTATTCTAAMCTCTTGTTCACATGCTATTATGACTTATAAAGCAGCA 

ACAGCTGAGGCGCACCAGGACACAGCTTCCATTTCTTTAACGT 

Sequence 1328 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 

AAAATGGGAGACAATTTCACATGGACTTTGGAAAATA I I I J I II CCTTTGCATTCATCTC 

TCAAACTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAA 

CCTTACCAAAAAAAAAAAAAAAAAAAGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTA 

CAAGCTCGACCCACGCGTCCGAMTMTAAAGCTAGMGTMTATTTTTCCTTTTGTCTA 

TTTTCCAAATTGACTCGATATTGATGGCTACTTTTGTAAG I I I I I ATTTAAGTTTAAAGG 

GAATATTTATTGATCACCTNTATGTGCTCAGTACCT 
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Sequence 1329 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGC 

TTGGGGATTTCGAGGAAGGGTTCATMGGGAGATTTTAGCTGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATG 

TTCTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAA 

TATCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGC 

TTGTGTATCAGTGGCAGGGTTTMGGTTCATTTTTTATTCCCATATTAATCCTTTAA 

TTAGACMATTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGC 

CCTCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAACC 

Sequence 1330 

AGGTCMGCTTCGACCCACGCGTCCGTGMCTTTTATCMGGCTTTTGCTCTTTTAGACT 

TGAGTTTATCTTTATAATTAAGGAGAATGGTTTTTAAMTTTAGTTCCTCTGACACCCCA 

AAATTATCAAAATA AATTA TGTTGTAGTGMTCTGTGTTTTGAAAGTCATTGATAGGACT 

TATATGAGTCAAAATTTTATGGATTATAMCTAGGCTTTATCTGGTTGGAAATAATTGCA 

ATACMGAAGCAACTTTATTAMTTAGACCTAAAGTCACMTCTTCTTCTTTGCTGCTTT 

TTAAAMTTACCTATTACCTTTAAAAGATCCCAMTTTAGAAGAGGAATTAAAATAAAAG 

TTAATGCAATAAAACACTTCCACAATATTCTATTACTTCAACCTCTAATCAATGAAA 

Sequence 1331 

AGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATA 
ATMTAMTTTTTGTmAAAAAAAGAGTGTTG 

CATGTGATATTCTTA GGTAT AGA I Mill) I AGTATTTGTCX)TGTATATTGTTATTCGAG 

CTTCTGGGATCTGTGTTTTGGTGTCTATCATTMCTTTGGAATATTCTCAGTCATTACTG 

CT TCAM CATTCATTCTGTTGCTTTTTCTCTTCTGGTA7TATCATTACACATATATCACA 

CCTT TTGT M TTCT CCCACAGTTCATAGATATTCTGTNGTATTTATTTATTTTTTC 

TGCCTTTTAGTTTTAGAGATTTCTATTGACATCACTTAAAGATGATTGATGAGTTGATGA 

GAATTGAGAGAATTGATGAGAATTGTTGATGAGAATTATT 

Sequence 1332 

ACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTCCTGCATTTC 

TAATGAAGAAATAGGCTGGTCCTCAATTTTGCAGAAGTTGTATCATCATAGGTCATACCT 

AACATTCGTTTGTCAAGAGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCA 

CAAACCTTCAGMTGTGGTTCCTTCCCGCAGGC7TTGTCACACTTAAGATCCAAGAACAA 

ATCAGCCTGGCTTTAACATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGA 

AACTTGACCT 

Sequence 1333 

CCGC GGTGG CGGCCGAGGTAGCGGTCGCAC I I I I I I I I I I I I I I I I I I U I AAATAGAG A 

TGAGGTTTTGCTATGTTGCCCAGGCTGGNCTNCTGGACTCAAGCAATCTCCCACTTCAGG 

CTACCAMGTGCTGGGATTTACAGGCATGAGCCACCTCTCCCAGTCTCAGTTATTATTTT 

MTAMTGAGACTGMCGTCCTCTTATMGGCTCACTCCCTTGTTCCTACTACATTTGCT 

CTGTTTMGTATCTCTTTAAATTCTTCAGTTAAGCGGACGCGTGGGTCGAAGACCTGCCC 

GGGCGGCCGGCCGCCCGGGCAGGTATTMCAGGTGCTTGCAGTTTGTTGACTTTTTTGAA 

AAMTCAAGTTGTMACTTTTATTACAMTTAAAAATGAAGTTCTTAAAAATCTCAACTT 

GACCAGATATGAAACMTTTAAAMCCTTTAAGGCGTATTGAGAAAAACCAGGCTTTTTT 
AAA 

Sequence 1334 

ACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCTCACCTGCCCGGGCGGCCGCTC 

GAGGCCGCTCACCTGCCCGGGCGGCCGGCCGCAC I I I I I I I I I ] I I I I I I I I I NNCAAAA 

CAAMCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGTGGAAAGTTTATTGGCM 

TATTAAATTTCACCCTANATAGGATATGAGAATGTTTTGATAAATCACAATTTATAGTAT 

ATTMTGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTA 

AACCTTAAAGAAACCTTTCTCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATG 

AATTTGTAGATTTCTTTTTAGTTTAATrCCTGAGTATACAGGGCMMGCTTATATCCTT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

220/467 



TATATAMCTTCTGCTTTGGTCTAAMCTGATATATCTTCACGTTGAGGTTTCATCTGAA 
ATGCNCCACCGTTTGCTGACTTGCTTCAATATGAATTTGGATGGCTATAAAATTGACCTC 
GGCCGCTCTAGAACTAGTGGGATCCCC 
Sequence 1335 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGCCGCTCGAGGCCG 

CACI I I I I I I I I I 1 1 I I I I I I I IGGTAMCAGGCGGGGTMGATTTGCCGAGTTCCTTTT 

ACTTTTTTTMCCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGTGAGGGTAATAAT 

GACTTGTTGGTTGATTGTAGATATTGGCGGACGCGTGGGTCGAATCTTGTACCTGCCCGG 

GCGGCCCATAGTTTGTCMCCACTGGTGTAMACCTTAGTTATATATGATCTGCATTTTC 

TTGMCTGATCATTGAAMCTTATAAACCTAACAGAAAAGCCACATAATATTTAGTGTCA 

TTATGCAATMTCACATTGCCTTTGTGTTMTAGTCAMTACTTACCTTTGGAGAATACT 

TACCTTTGGAGGGAATGTATAAAATTTCTCAGGCAGAGTCCTGGATATAGGAAAAAGTAA 

TTTATGAAGTAAACTTCAGTTGCTTAATCAAACTAATGATAGTCTAACAACTGAGCAAGG 

ATCCTCATCTNGAGAAGTGCTTAAAAT 

Sequence 1336 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAGATCCTGCGGAAGGAA 

TATGTTTTTGCTGACTCCAAAGTAAGTGACAGCAAACTTCTAAAATGGGCTGTGAGGTAG 

GGAGGGGACACAAGCGTTTTGAGGCTCGCTGNGTGCCAGGGAGTGTATCATTAGCTCACT 

CAATTCCCAGAACAACCCATTTCACACCTGGGAAAGGTGAACTTAGAGAAGTTGAGGATC 

ATGTTCCAGGTTGGCCTGGATTTGAGCCATCACTGTCTCAGGAGTAGGGAGGCTTCCCAC 

TTTGCCCAGCTGCCTCCCAGCCTCGAGGCCACATCCTTTATGACCCACATCTAACTCAGC 

CCCACACCTGGGGGAAAGGCTTTCAGCTTCTCTGGGCTGGACTTGGGAAATCTTTGGGAC 

ACTCCTGACCTGCCCGGGCGGC 

Sequence 1337 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCAGGCCCAGGCAGAACCCAT 

CCCATTTTATCCTTAAACTCAGAAGGAAATNNGTCTAAATATTAAAGGATTAATATGGGA 

ATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTCTCTTAGAGTTCAAT 

GMCACTTCAGGAGAGTATTTCCMCMTATTTAGATATTGGAATATCTAAATATTGTTG 

ATTTAG ATMC CACCCTAGATTTCTCACCACCCTAGMCATTTAGNGGGGAGACATTCTT 

TTCTCCTTTTTCTGATAACTTGGTCAGMGTGATTGACTGTGGAMTGGTATTTCTCAGC 

TAAAATCTCCCTTATGAACCCTTCCTCGAAATCCCAAGGT 

Sequence 1338 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGCTT 

GGGGATTTCGAGGAAGGGTTCATMGGGAGATTTTAGCTGAGAAATACCATTTGCACAGT 

CAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATGTT 

CTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAMTCMCAATATTTAGATATTCCAATA 

TCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGCTT 

GTGTATCAGTGGCAGGGTTTAAGGTTCATTTTTTATTCCCATATTMTCCTTTMTATTT 

AGACAMTTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGCCC 

TCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAAAACCTGCCCGGGCGGC 

Sequence 1339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

T1TTGATGAAATATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGNGCTA 

AGCAAAATAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTTATAGCATGGGCTCAGGTATCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTAC 

TTACGGGAMTACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAA 

AGTACCT 

Sequence 1340 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTACTCAAGTAGTCT 

TTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAACAAGGAGTTTGATNTGTGTTNGCTG 

ATTGTGAACCATCAATTGAGATAACTCACTACCTTCAGGCCAGCCAGTTACATACTTTTG 
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AAMGCCAAGAGTGAAGCAGGGTTGTTITrCATCCMTTCTTGGTCTTTTTGTTAAAGGC 

AGCAATAAGATAGGGTGGTTTCGGGCAATCACTTAGCTAATTGGCTCTCTATAGTCATAC 

CTGGATAATATTTGTAGTCATACCTGGGATAATATTTAAAGGGAAGAAACTAAACATAGT 

CCTTAAGTAGGAACCAACTACAAT 

Sequence 1341 

CCGCGGTGGCGGCCGAGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTAT 

TTCTTATTCTACAACAAGGGNGCAGCCTANAGGCAAAACACATCCCATTGTC AI ITTTTT 

GTAMTAMGTTGTATTGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGC 

TTTCACTTACAACCTGAGTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATAT 

TTACTATGTAGCTTTTTCTTTTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTG 

GAGTGCGGTGGTGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATCCT 

CCCGCCTCGGTCTCCCMGTAGTTGGGACTACAGGCATGAGCCACCATACCCGGCTAATT 

TTTTTAAAGTTTTTGGTAGAMTGGAGTTTTTTMTC 

CCTGGTCTTAAATGACCCTTTTCCCATCAG 

Sequence 1342 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAGTTATTTATTGATTTAATCAT 

TGTMTCTCCAATAGAGATTACMTAGAGATCTCCAACATGATTTCATGCATTTAGAGGA 

GAMTATTTCCTGGTTAAGTGGAAAATTGTGCGGATGTGGCTTCTGGAAGACCTTCATTC 

TAAAGCAGCGGACGCGTGGGTCGAAACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCTG 

CAATCCAGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 

AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGAGGGGTGCTCTCCTGAGCT 

ACAGAAGGAATGATCTGGTGGTTAAGATAAAACACAAGTCAAACTTATTCGAGTTGTNCA 

CAGTCAGCAATGGNGATCTTTTTGCTNGTCTTGCCCATTCCTGGA 

Sequence 1343 

CTATAGGGCGAATT GGAG CTCCCCGCGGTGGCGGCCGAGGTACCAGCTTTAAAACCCATA 

CCCCTCC AGGGT TCTTTTCTGTTGCTGGTGAAGTGCACTTTTTAAAAGAGTNATTCATAC 

CATCAAGATTTTTGACMGAAAATTTTAGAAAAACTGTGGAAGAAAACTGATTGCTCTTA 

GTTCTAGCCATGTGTMTTGCTGACCACCTGAAATGGTCCAMCTGAGATTTGCTAAAGC 

ATAAAATACACACCATATTTCAMGGTTTTTAAAAGMTGTAMACATTTCATTMTTTC 

GGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGCCGAGGTGGATGGACCCATCCATTC 

AGGCAGGGGGTGTGGGGTGTCCCCTGTGCTTAGAAACCACCTAGCATCATAAGCTGCAAC 

AGCACTTTATTGGGATCTGAGTCTACAGTTCACATAGGGAGGTGAAGCCGTGGGAGAAGC 

AGGGNGTAAAAAAAAAAMAGGGGGGGGGACTTTACCCCCCTAAGGACAGGNTGCTTTCC 

AAACCTAACAAAAAC 

Sequence 1344 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTAATTTGTTGCACACTAT 
GTMCAAAACAACTGAAGATATGTTTAATAAATATTGTACTTATTGGAAGTAAAAAAAAA 
AAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I ( I M I 1 1 I 1 1 1 1 1 1 GCTGGGGTTT 
TTTTCTTTTCI I I I I I I I ICAGCTACAGGAATTTAGCCAATTCANAGGAAATCTTCCCCA 
TMTTATGGMCTrTNTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACT 
GAAATAACAATAACAACTTCAACAAATAAAAAAACAGTTAAGCTAAATAAACAGATGATT 
GCAGAATTTATGTGATTACTGGGTACCTCGGCCGCTNTAGAACTAGTG 
Sequence 1345 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGTTTGAGTTGAAACGGTATGTGACTTCCC 

CAGCTGCACCCTGGGCAGTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAA 

AACTCCACAGATCACTCCTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCC 

AGTGCATTGAGGCTGGAAGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTC 

CCCTGTAGTTGGGCTAATCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTT 

TTATGGATATTTAGTATTTAGATMAATTTTTACAGTATTCTTGAAATGAACCCAATTAA 

ACACATAGTTCTCAGTCTTGACCACACATTAAGAATCATCTGGTAGACTTCTGTAAACTA 

CCAATGCCTGGCCA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

222/467 



Sequence 1346 

CCGCGGTGGCGGCCGCAC I 1 1 ( I I I I II I I I II I I II I ACCTGAAAATGCTTATTCTAGC 

TTCACATTTGATTGTTTGGCTMGAAGAMATTATTTATTAGACTTAATTTTCCTCAC 

GTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCA 

TTCT GATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTANACGCTTTCAGGACCTTCTT 

TTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 1347 

CTCCCCGCGGGGGCGGCCGCCCGGNAGGCNAAGCTTCGACCCACGCGTCCGCTTTAAAGG 

GAATTCTNTGTAGAGTGGGAGGCGAACACGNCTGGNNCTTCCAACTCAGGAATTCTCGTG 

GCTGGGCTGGGTCAGCGATGGCTTTGTCTCTTTATGTCTAAAGTGCCCTATGGCATGCTG 

AAGGTTACCTAACCATTCTTTAAAAGGAGAATGACCCTCCATGGGAATGGCCAGCCTGCC 

AACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTTCCTTGGGGAGAAAGTG 

CATAAACCAGGGGTCCC I I I I I I I I I I 

Sequence 1348 

AGGTCAAGCTTCGACCCACGCGTCCGCAAAAATCAATCAAGGGTTCCTACTCAAGTAAAA 

AGCAACTTGTAGGAAAATMTAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTG 

GCTCTTGGTGCACTGCCTGTGAGTTAGCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAA 

AAAAAAAAGT 

Sequence 1349 

GCGCGTATACGACTCCTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTA 

CMACTATGTATCTGAAACACTTCTATTTGGCAATTTTATMCAAATCAMTTTTAAAAA 

GAACAAAAGAGATTGCAGATTACTTCGCAGATACAGAATAAAGCAATTGATGAAGTGCTT 

AAGCAAAAGAAACMCAAAAAMGAAAACACACTGCTTTTCTTTTTAAAAATAAAATCAC 

ATTGCTATA GATCA AATGGATAATACCCTTATTAAACAACCATTCCAGAATGTCTTATAG 

TAGCAGTGCTTTTATTTGCACTTCACTTMTTTTATAAGACTCATTTTCATGTATATAGC 

TCTTTACCCCCATTGTTMCGMTAMGTCTCTCATAATTTTTAC^ 

AAAGCAAATGAGAMTGATTTATGTATCGTGGAACCTTTCCCATTTTGGAACCAAAGGTT 
TTAATTCT ATATTTTTG NCT AATATTTCTTTAAAAAAT 
Sequence 1350 

CC GCGGTG GCGGCCCGCCCGGGCAGGTACTATCTATAMGGAGGTTTGATGTTTTCATTA 
CTGTTTTTGTAAATATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGT 
GGCTCATACCTGTAATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGA 
TAGTTTTATTCACTTGGCTGTTTCACCAAAAAAAAAAAAAMAAAAAAGTGCGGCCACCT 

Sequence 1351 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGATTACCATGG 

CAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGTAG 

AAACAAAGTGCGGCCGGCCGCCCGGGCAGGTGCCGCAC I I II I 1 1 I I I I M II I I I II AG 

AATAGGATTGMTTTTATTAAAAACAAAAATAAATCTAAAAAGCTTCCTTCAGTTACAAA 

TATGCACMGMTTTCTGCATTACATCATTTGACATAAAATGTTCTGAATGACAGAAGTA 

GMGTAGAACTTACTACCATTTGAAGACAGGAGTTGAGCGCTGAAAACCACACACATTTA 

TAGAAAGAAACCAAAGTTTCACAGGGAAGACCTGTGATCTCTGGCTACAGGAGCTGAAAT 

TAGGAACATGAAAGAAACTTGGAGAGAGAAGACATTCAATACTCTAAAATACTTCAGCAA 

AAATAGTCAAACATNTGTNAACAACTTGGNACAAAACTTTATATGGTGGGGGGTGGCTAT 

GCCGGAAMANTCTTNACTGGNTATTATTCACCTCAAAAAGGGGGNTTTTAATGNTCACG 

AATCCTTCCTTTMMTAMANMAGCNTGGNTTNTTTTTCTGGNGTCAAGAGTAAAANG 

TANTAGGGNNTACTCAGGGATGGTTTGGMTTTTAAAACGGGGCNTTCCACCCTTGGTGG 

CTNGTGGCANTTTANCCCAAAACGGCNAAANAACCGGCCCGNGGTCAACTTGNAAAACCT 

GGCGGNTTNAAATANGGACCCCGGGTGGGGGATTNANTNAGNTTTGANCCGNNNCTTGGG 
GGGGCCGGCCC 

Sequence 1352 
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cgaggtcaagcttcgacccacgcgtccggttactagagcccttgaagggcagcacttggg 
tttagttggcctcttctgttt1tacctm 

tttcttamtcccagtaattatcttagganmgcagttttctatgcatagacagttcatt 

tagattggtaacatgggctaggtaaaaaaaaaaaaaaaaaaaaaaggtgcgacctgcccg 

ggcggccgctcgaggccgcccgggcaggtactatgtcgattcgacagaacattcagaaga 

ttctcggccttgccccttcacgagccgccaccaagcaggcaggtggatttcttggcccac 

cacctncttctgggaagttctcttgaactcaagaactctttattttctatcattctttct 

agacacacacacatnagactggcaactgttttgtagcaanagccatangtagccttacta 

cttgggcccntttctaggtttgm™ 

atggcnccngcaaacttgnaggggcttttggnnccanaatgatttttaataaaaaaaagg 

ggattgaataghn"aaantcmgggaanggtttatgnaaaggaaaaaaaaaagcctccttc 

ntggtttmtttacaaaaggtnttttntngggggaccgnctntaaagnactngggnttnc 

cccgcmggtgggnnggtattnacccntttnnggnttnaaaaaaaaaanttngnnngggt 

naaccctggaacngggggggttnggng 

Sequence 1353 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTCTTCGACCCACGC 

GTCCCGGGGTTTCAAACTGMTMCTCATTGTTGGGTCTGGATCTGTAAAMGGTTTTGT 

CACTGATGGGCAATATGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAAC 

MCTGGATGAMGCTATAAACAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGA 

ACTTCAGGAGATTCCTAAGAAGGCCTTCAGTGATCTTTTCATCAATATCATCAGGCCTTA 

TCATTGTTTCACATTTGCTTCTCTTTACCATAGGGMTATAATAATTATTTACTGGTTAA 

CTTCCTAGGGAGATTGCCTGCGGCTTATTTAAGATCCAAATTTTAAAGTMTAATTTCTG 

TTGAAGCTGCTTGTGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTA 

AATGAGCCCGGGATGTAGGCAGGTTTTGATGTTTTGCTTGCTTTTATCCCTAACATT 

Sequence 1354 

TATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAMTATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGTGCTA 

AGCAAAATAMTATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTAAGCATGGGCTCAGGTATCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTACTT 

ACGGGMATACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAAAG 

TACCT 

Sequence 1355 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGGGGTTTCAA 

ACTGMTMCTCATTGTTGGGTCTGGATCTGTAAAAAGGTTTTGTCACTGATGGGCAATA 

TGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAACAACTGGAATGAAAAG 

CTATAAACAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGAACTTCAGGAGATT 

CCTMGMGGCCTTCAGTGATCTTTTCATCMTATCATCAGGCCTTATCATTGTTTCACA 

TTTGCTTCTCTTTACCATAGGGMTATMTMTTATTTACTGGTTAACTTCCTAGGGAGA 

TTGCCTGCGGCTTATTTAAGATCCMATTTTAMGTAATAATTTCTGTTGAAGCTGCTTG 

TGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTAAATGAGCCCGGGA 

TGTAGGCAGGTTTTGATGGTTTGCTTGC 

Sequence 1356 

CGCCCGGGCAGGTACTATCTATAAAGGAGGTTTGATGTTTTCATTACTGTrTTTGTAAAT 

ATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGTGGCTCATACCTGTA 

ATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGATAGTTTTATTCACT 

TGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1357 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGTCCCGAGTGTTTCCA 
CTCTGTCCATAAAATGGGAGCTAATATTCTCCAACCTGTGTGCCTGACATGATGGTTAAA 
GGGATTAMCAMACAATAGTTTGTAATTTATTCTGTCAGAGCAAACTGCTGGTAAATAA 
AAGGGCTAGTGACGAAAAATAAATTTTAAAAMCCTMTAAAACAAGTTTGTAATTTATA 
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ATTGTATACAAATAAAAGATGTTACAAAAAAAAAAAAAAAAAAAAAGGACCTGCCCGGGC 

GGCC GGCCG CCCGGGCAGGTTTTATTTMCATTCAAACTTCATTAAGACATGTGCAATAT 

GGCMTTTTACTGGGGATTAAACCCTACCTAGGATTGCTTGCTGGGGCTTAGCAACAGGG 

TCCAGTTCACACTTAGCACTAATTAAATACTTTATTGAATAAATACAATACCAAACAAAA 

TGCATTCAAA 

Sequence 1358 

CCGCGGTGGCGGCCGAGGTCAAGTTTCGACCCACGCGTCCGCATTATCCTTCTTGCCATC 

TACCCCATGTTAMGCCAGGCTGATTTGTTCTTGGATCTTAAGTGTGACAAAGCCTGCGG 

GMGG MCCACATTCTGAAGGTTTGTGGGCTGTGTGAGATCCAGAGAACCCAAGGGGGTT 

TTTTTGCTCTTGACAAACGAATGTTAGGTATGACCTATGATGATACAACTTCTGCAAAAT 

TGAGGACCAGCCTATTTCTTCATTAGAAATGCAGGAAACCTGCCCG 

Sequence 1359 

CGCG GTGGCGGC CGAGGTCAACGCTTCGACCCACGCGTCCGGGACCTCAGAATATAAAM 
TATGGI I 1 1 1 II I ICAGACTTACTAGI I 1 1 I I I I GATAATTCCTCTACGAATGTTGATTT 
AACTTAGAMTATGTAAATTTAATATTCAAMCCAMTTATTTTTTAAAGAGGAAAAAAA 
TATAAAACCTGCCCGGGCGGCCGGCCGCAC 1 I I I M I I 1 1 1 1 t I ) 1 1 I I J 1 1NAAACTTT 
AATAGNGTNCGGAAGNTGAATAATTTATGAAGGAGAGGGGTCAGGGTTGATTCG 
Sequence 1360 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCAC 

NCGNCCGGAGATGTATACTGCCACTATAGGAACTATAAGAAAAAGTCAAATGGAAATNTN 

ATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGATANGGNGTCAGTGAACTC 

AAAAAT NANTCAA TNGAAACTACTGAAACTAAAACTCAAAGAGAAAAAAAANGATGGGAG 

ATMTTAT TTTTT MGAATTGGTCATCAAAATGTAGCAACAAGTTCGCCTTATCCTATAT 

CATTTGAATTTTCAAAAMTAAGCTCATTATACAATCTTTAAMTATTTTGAATAGAACT 

GTTTCATGTGTTATTNGTGAAAAT 

Sequence 1361 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 

CAAGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 

TCAGGCCAGCCAGTTACATACTTTTGAAAAGCCMGAGTGMGCAGGGTTGTTTTTCATC 

CMTTCTTGGTCTTTTTGTTAAAGGCAGCMTAAGATAGGGTGGTTTCGGGCAATCACTT 

AGCTAATTGGCTCTCTATAGTCATACCTGGATAATATTTGTAGTCATACCTGGATAATAT 

TTAMGGMGAMCTAAACATAGTCCTTMGTAGGAACAACTACAATTTTAAC 

Sequence 1362 

ACTGI I I i 1 1 I I I ATTTGTTG AAGTTGTTGTTGTTATTTCAGTC I I 1 1 I CTTATTGGGTT 

GACCAGACTTGGTAAAATCTGTMGAAAGTTCCATAATTATGGGGAAGATTTCCTCTGAA 

TTGGCTAAATTCCTGTAGCTGAAAAAAAAAAAAAAAACCTGCCCGGGCGGCCGGCCGCCC 

GGGC AGGTTACMGCTTCGACCCACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACA 

TTTTAA ATGAC CTMTACATATT^ 

TTAMATTTTTGATGGCATTAAAATTTTNATGCCATTA^ 

CA 

Sequence 1363 

CCGCGGTGGCGGCCGAGGTACCACGGTTGTCCCCTGAAAGGTGTTGTGTCCCTCACCAGA 

CTGGGAGCACCTCMGGGCAGAACCCATGTCATGTTCCTTTTTGTATTTCCAGACCTGAA 

ACTGCC AGTAAATAAACCC TAAAAGTAGAAAGAAAAAAAAAAAAAAAAAAAGTGCGGCCG 

CCGCACI 1 1 I I I I I I I I I I I I I I I I I NGGAAMCCAAACATGCTTTATTTCATTTTTTTC 

ACAATTTATTTAAACATCTCANATATACAAAATAGGTACCT 

Sequence 1364 

CCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCCAGAGAAGCTGAAAGCC 
TTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGATGTGGCCTCGAGGCTG 
GGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACAGTGATGGCTCAAATCC 
AGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTTTCCCAGGTGTGAAATG 
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GGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCACACAGCGAGCCTCAAA 
ACGCTTGTGTCCCCTCCCTACCTCACAGCCCAT7TTAGAAGTTTGCTGTCACTTACTTTG 
GAGTCAGCAAAAACATATTCCTTCCGCAGGATCTTCCGGACGCGTGGGTCGAAGCTTGAC 
CT 

Sequence 1365 

TACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 
GATTGATTAATACCTGTCACAGATACATTTTGGTTTACAAATCAATGAACAATGGAGGGA 
ACTCTGTCCTTAATCTTGGTACGAGACAATGAACCCCAGGTACTTACCCCAGACAACGAC 
GCCGCTTNACCATGATGATGGACAACAGGCAAC I I 1 1 I 1 1 I I GGAGTTTCAGCTTGCTTC 
CAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTCTAAGACGATAGAAGTTTTCAGCC 
TAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAG I I I I I GTCTTG G AAATTNTCCTT AA 
TTG N NT NAN N CGTTMN ATTMCMACTAGCTGG NTNTTAMTTTTTNTCNTTTACCCAT 
TANAGGTNCCCCCANAMTTNAAAATNAAATTTNTGCMTTMTTTTTGAACCTTGCCCC 
GGGGTGGGCCCTGGCCCCCCCCTNGACAANGN 1 1 I I I I < I I I I I I M II 1 1 1 I 
Sequence 1366 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCG 

ACCCACGCGTCCGGAGCTGCTCAATAGTGAGAATCAGGTGATATMTGCATGTGGAAAAA 

GAATGTGAAAAATCTMCACTTTAGATTGTATACAGTGTTTTTTAAAAAGACACAAAAAA 

ACTGTCAACATGAGAAACATAAGCAAAGTTTTACTCAAGACAAACATCCACGAGTCACAA 

CTTCAGTTATTCCCAGTCTTCAAAATAACAGAAGGGCAAAGCAAAGGTAAACATGCAAA 

Sequence 1367 

GAC TNCTATA GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTT 

AAACTTTTTTGATGAMTATTGAGTCTTMCTACTTTAAGATGCCATAATACTGAATACA 

GTGCTAAGCMAATAAATATTGACTAGTTCTCATTTCTATCTTTCAMTATTTCTAATGC 

TCCTCTTTTATAGCATGGGCTCAGGCATCAGATGGCGTAGGTCAAGATCTTGGCTCTACT 

GTTTACTTACGGGAAATACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGG 

GATTAAAGTACCT 

Sequence 1368 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTAMCAGAGGMGTATTTACATTCTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCTCGGCCGAGGTACCTTCTGTCACAAAGACCCAAGCTTCCTCCAGCTTCCAGGATAG 

CAGTCAGCCAGCTGGAAAAGCCGAAGGGATCAGGGAGCCAAAGGTGACTGGGAAGCTAAA 

GCAACAATCACCTAAATTACAGTCCTCCAAGAAAGTTGCTTTCCTCAGGCAGAATGCCCC 

TCCCAAGGGCACAGACACACAAACACCGGCTGTGTTATNCCCATCCAAGACTCAGGCCAC 

CCTGAAACCTAAGGACCATCATCAGCCCCTTTGGAANGGGCC 

Sequence 1369 

CCGGGCAG GTCG AGCGGCCGCCCGGGCAGGTTTCCTGCATTTCTAATGAAGAAATAGGCT 

GGTCCTCMTTTTGCAGMGTTGTATCATCATAGGTCATACCTMCATTCGTTTGTCAAG 

AGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCACAAACCTTCAGAATGTG 

GTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAAATCAGCCTGGCTTTAAC 

ATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGAAACTTGACCTN 

Sequence 1370 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCGACCCACGCGTCCGACGACTCACTATAGGGA 
TCTAGATCACGAGCGGCCGGCCGCCCGGGCAGGTACAGAGATTTAAATGAAATCTTCGAA 
AGAATAAATTTGCTTTTCAGTCCACTGTATTTTCAAAATTT 
Sequence 1371 

CCGCGGTGGCGGCCGAGGTACTTCAMGTTATTGCACATACACTTGTTTACTTTGTATGT 
TTTGCAGGATTAAACTTTGTATAATCTTTTTACAAAA T TTTTTTn CAGTATGCAAGCTT 
GCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAAAAAAAAAAAAAAAAAAAGTGCGG 
CCGGCCGCCCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCGGAGAnnnnnnnnnnnnn 
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nnnnnnnnnnnTGATCTAGATCACGAGCAGCCGCACTTTN I I I 1 1 I I I I I I I I 1 1 1 1 I GA 
CGAATTCAGGAGTCCTTTATTAGCCGGCAGCCGANAGACAGCTAGCGCTCAAAATTNTCT 
TGGCCCCGAAGAAGGAGCTAGATTTTCTTTTATAC 
Sequence 1372 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATAGCACATTACATGTCAA 

ATCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTC 

CGTGAAATCAATATCCCGCACAAGAGTGCTAAANAAAAAAAAGNNANAGGNCTGCCTNTG 

TnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTNGNTATCAAGCTTAN 

TCGATACCCGTTNACCTTGNGGGGGGGGCCCGNrTANCCCANTTTTTTNTTTCCNTT^ 

GGGGGGGGTAAATTTNGCCCCNNNTNGGGGNNAAAAAAAANGGGNAAAAAAA I M I I I I I 

CCTNGNGGGNAAANTTTTTTATNTNNTTTTTMNANANTTTTNNCCAAAAAM 

NGGNCCNGGGGGNNNNNTAAAAGNTAAAAAANNNNTNGGGGGGNCCNTTTNNGGGGGGGN 

CCNNNANNNNCCAATTNNNNNNGGGGGGNNCGNCNNTNTTNCC C I I I M M I AAAAAANN 

AAMAAANCNCNGCCCCCCCCNTATTTTTNAATTNAAANCTCCCCCCCCNNGGGGGGGGG 
G 

Sequence 1373 

CCGCGGTGGCGGCCGCCCGGGCAGGTTATAGACAATATGCTCCTTAAGGTCCCTTTCAGT 

CCCGTTCTATGGATCTGTGTAGTTTCGCTTCTTTTTTCAATATGCTCAGAATTAGGACAC 

CAATGTTAATGGAAGATAAGGAAACTATACCACCTATCCCTTATAGAAGATTTGTGCACT 

AACTAATATGAGCCCTGGAAGATCAAGCCAGTAGAAGATAGAAGATCTATCCCTGCTTTA 

TACTTTGGATCATTTATTTGTGAAGATC^ 

TTCATGGAAGTTCGGGGAAATTAACTGGATCTACTTCTAGTCTAAATAAGCTCAGTGTTC 

AGAGTTCAGGGAATCGCAGATCTCAGTCATCTTCCCTGTTGGATATGG 

Sequence 1374 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGAAGAAGTGCTATGTGGCGAGAAAAAAAAGTTTTAATGTATTGGAGMGTTTTAAAA 
AACCCAGAAAAATGCC I I I 1 1 I I II 1 1 I II I GAACATAAACTCAAGATTTTATTGTCTTC 
ATAATAAAAGATGACACTTAGAACTGGATCACTTGGCCCTTCTCTTCTTATCTCCTCCCA 
GTTCAAMTGCTTGCATNTTTTAATAGCCAGCATTNTTTTMNATCTGCAGGTNGGGCTT 
AAACNCCNCTTAAAGCCTTTAAACACAAA I I I i I I I I I I GGNGGINTTTTTAAGCCCTTnT 
TCCCGGAAAAAANTCGGNTTTNAATTTrGGNCCNANAAAANNACCCCTTTTG I I I I II I I 
NNNAANAAAMCCNCNTTTTTTNNTN 

TTTAAAAAAAAAAGGGGGNNCCCCCCCGGGCNGNGGGGGAI I I I 1 1 I I NTAAANNTTTTT 

TTNNNCCCCCCCCCCC 

Sequence 1375 

CCGCGGTGGCGGCCGAGGTACTTCAMGTTATTGCACATACACTTGTTTACTTTGTATGT 

TTTGCAGGATTMACTTTGTATAATCTTTTTACAAAAI I I I 1 1 I J I CAGTATGCAAGCTT 

GCAAGATGAAAATAAAACCTGTTTGCCTGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTGCGGCCTCGAGCGGCCGCCCGGGCAGGTACTCTATAAGAGGTGTGGGTGTCTTGTTTG 

GTCAGGATGTTAGAMGTGCTGATMGTNNCATGATCAGTGTATNCCAMAGGTTTTTAG 

GMGTATGGCAMAANTGTTGTATTGGCTTTTATGGGGACATGATNNTAGTCNNCTTCCT 

TTTTMANANGNNTTATNTTGTTCNANTGGNTTAANTGGTTTTTAAAAA 

Sequence 1376 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 

CCACGCGTCCGGCCACATTTTCAATTTAGCCATTTTTCTCTTATTCACCTTTT^ 

ATTAC TCTG TAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCT 

GAGAATTTTATTAGCTGCTAACGCTTGTCTCATCATAAGCACTCATATGTTCATTGAGTA 

MTATTTATTGAGTATTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGG 

TGATTGAAACATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAA 

Sequence 1377 

CCGCGGTGGCGGCCGCCCGGGCAGGTMTCACACAACACTTTCTTTTCCAACTGCTGCAA 
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AGTGCATCTACAATATGCTATTACAGATCCACTTTTAAAAGGTTTCCTGTGACATTACAG 

CAAGCCTCTTTTTTCAAACAGAGGAATMTCCCAMTTCTTCCTCAAATAAACTCCATTC 

CAGTAAATGGTAAATACATAAAAAATTACAGTAAGCCAGACACTTAAAAGGACAGCCAAG 

MGTCTTCCAACAGTTTATTAGAAAGAATGTAGACATTTAAAAAAATCCCCACTGTCATG 

AACATAAATTGAGGTTTTCAGCCCGGGTATAAGCTGAATCAAAAAAAGGAAATAAAAAAT 

CCMTAGTGTATTAAACATTTTTCACTCATTTGCCATACTGACAGTGCAAATCAAATCTG 

GACTAA 

Sequence 1378 

CCACTGGATTTGACTCAGAGAGGACCCCCAGAGGGTGTCTCCATCTTCCCTATTTATTTT 

CAGCCCTTGAGGGCTTCATTGTAGATCAAAGCCAAGGCCCCCAGGAAGGNGACATACTCC 

TGGMGTTCACCTCCTGGTCCTTGTTCCGGTCCAAGTCTTCCATCAGCCTTGCAATTTCA 

GCATCCTGCAGCTTCNAGCCAATGGTGAGCTCCTTCTGGATCAGCTCCTTCAGCTCCTTC 

TTGCTCAGGGTGTGCTTGTCACCCTCCCTGCCGGAGATACCTGCCCG 

Sequence 1379 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCAGGGATGGGAGG 

CACMGTTGTGATTGGGCAAAGTTTATTTTCTATGTCAGCCTGTCAGTCCACTGCCCCAT 

TTTGCAAGAC I I I I I I I I AGCCTTGACAAAATGTCTCAGTTAAGTATAAAAGTTTTTCCA 

CTACTTAGTCCAAAAAAAACTATTAAATCTTAATGAAATAGCCACTCTCAAAAAAAAAAA 

AAAAAAAAMGTGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTCTTTTTC 

TTCCCAAACATAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGACCTNAAGAAAGC 

AACGAAAGGAACGCAAGAACAGAATGAAGAAAGTCAGGGGGA 

Sequence 1380 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGCACATTACAGTCAAA 

TCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTCC 

GTGAAATCAATATCCCGCACAAGAGTGCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAMGTGCGGCCTNGAGCGGCCGCCCGGGCAGGTACATNTATTTTGGATTGTATATTGNG 

TTTGTGTATTTACGCTTTGATTCATAGTMCTTNTTATGGMTTGATTTGCATTGAACAC 

AAACTGTAAATAAAAAGAA 

Sequence 1381 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATCTGTCAA 

ATATACTATGAAATGCATAGTCTCCACTTAAAATGCTGAATGACACACACGTTTTGCAAG 

CATTACTGCTTTCCACAAAAACTGCTGAATAGGAGTTCCGTCCCTGCCAAGATCAGTGTT 

TAAGAGATACTTTATGATGCTGATMGTATTATTGGTGGTGGTGGTGTTCAGAAAGTTTG 

TCACTCATGCAGATGTCTGAAATCTTGTTCCGAATCCATGGAACATAGGGTGGAGGCCAG 

CTCCCCCTTTTTTTAGATGATCACATAGTTCTGAGCAGAGATGTGGTCCTCACCCTGCAG 

TTCCTGCAGGAGCTGCTGCTGCTGGGATGGCTGCTGG 

Sequence 1382 

CCTAT AGGGCG AATTG GAGCT CCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGATT TTTTT AMGTATTTTCTAGTCTTTT 

TGCCGTGTTTTGAAAGTMGATGATGAMTGMTTTTTMTTCAAGAAACATTCAGAAAC 

ATAGGMTTAAMCTTAGAGMATGATCTMTTTCCCTGTTCACACAMCTTTACACTTT 

AATCTGATGATTGGATATTTTATTTTAGTGAMCATCATCTTGTTAGCTMCTTTAAAAA 

ATGGATGTAGAATGATTAAAGGTTGGTATGA i I U I I M I AATGTATCAGTTTGAACCTA 

GMTATTGAATTAAAATGCTGTCTCAGTATTTTAAAAGCAAAAAAGGAATGGAGGAAAAT 

TGCATCTTAGACCATTTTTATATGCAGTGTACCTGCCC 

Sequence 1383 

CGAGGGAGTCTATTGGAGCCCTTGGGAACTTCTGGATCAAATTGGACCTGAATTGAGATC 
TATTTCTCAGCTTTCACTTATGTGAGCCMTAMTTCCTTTTTTGTTGAAGGCAAAMM 
AAAAAAAAAAAAAAAAAAAGT 
Sequence 1384 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC l I I I I II I I I II 11 I I I 1 I I 
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TTTTMCGTTTTMTTAAATGGATTTATTTAAAAAGACTATAAAATCTGACATCAAGAGA 

GATAAAAAMAMGACCCATMGATTTAMTTGACAAATGTAAAATGATTGGCTACMTG 

TAAMATACATTTNCCAGCCCCCAAACAAMCACAAGTATAGTAATTATAAAATTTTTGG 

ACCTGCCCG 

Sequence 1385 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTATTTCTTATTCTACAAC 

AAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCATTTTTTTGTAAATAAAGGTTGTA 

TTNGGAACATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACAACC 

TGAGTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAGCTT 

TTTCTTTTC 

Sequence 1366 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 
CGAGAAGAGTTTGCAAATGCAACAAMTATTTAATTACCGGTTGTTAAMCTGGTTTAGC 
ACMTtTATATTTTCCCTCTCTTGCCTTTCTTMTTTGCAATAAAAGGTATTGAGCC^ 
TTTTAAATG ACA I I I I IGA 
Sequence 1387 

CTACTATAG GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I It I II I ) I I 1 1 1 1 I 1 1 
I I I ! I I I I IGCCTTCAACAAAAMGGAATTTATTGGCTCACATAAGTGAAAGCTGAGAAA 
TAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAGACTCCCT 
NTCACCCTGGT ACCTGCCCG 
Sequence 1388 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGG 

GGAGGACCTAGGCAACGGCCTGAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAAC 

CTACTGAAGATACCGTGGTTTTACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGC 

CATCTGAGACCAGCCATCTGGCAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATC 

CCACTGACACCACGTAGAGCACAAATGAGTTGCCTCCACTGAGCCCAGCCCAAATTGTTA 

TCCTATAAMTCATAAAAACATAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAA 

ATTAAGTGCGACCTGCCCGGNNCGGCCGGCCGCCCGGGCAGGTACCCATTAATTTGCTCA 

GATATAGCAGGCTTAATGGTTCTATATTTTCAAAAGTTTTTAAGAATGGTT 

Sequence 1389 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAATAAGCCCACC 

CCACTAGGAACTATGTTAAAAAAAAATTCMGAAAGMTTTAAGGGAGATTACAGTGTTA 

CTGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGG 

GTTGGCCATNANAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAA 

CCCNTAAGCCAAGGCACCCAGACCAGTTTNCATACATAGAAAGTTACAGCTGCTTTTATA 

CCCCCTTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAG 

GAMGACCAGTAGAGAGAAMAAAGGCCATCTATACCMTTNTMGTTMTTTAGACTAA 

ACAAGGTCTTAATAGCAMGGATMTTGAMTCCCMACTTACAAGGTTTTTTAAC 

Sequence 1390 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGGTTTCATGAATGGAAACCTAAG 

TAAAACTAAGCTCATTAGTGACAGACTTGTTTTCTTCTTGTTATTCCTCCAGCAACTCCC 

TCACCACCACGCCTCCCTGCCTACCATCCCCGGAAGGGTGCTTATTCTTTAACAAAGAGA 

ATCTAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTGAGAAAACAGACCATATT 

TACTCACATMTTCGCCTTCTACCTTTCACCTGCTTATGTAATAATTTAACACTGTAGAG 

GGGACATGGAGGTGACCGGAGTATTTAGTGGGTTCCTTGCTCCTGGGCTGGGCAGG7TCA 

CAGGCCCCACAGGCCTTGGGCCCCAGCATC 

Sequence 1391 

CCGCGGTGGCGGCCGAGGTACTCTGAGGTCACGTCAAACTATGCTTTAGAACCTTCATCT 
TTTGCTTTCTTGGGCTTTACTTTCCAAAATGGACTACAGGATAATGAGGC I MM I AAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 1392 
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CCGCGGTG GCGGCCGCACCTGCCCGGGCGGCCGCTCGAGGCCGCAC1 1 I I II I 11 I I t I 1 
I 1 1 I I I I I I ITT AAAMTTCAAMMTTAGTTTATTAGCTTMTATAATTAGGTCAATGG 
MTCCTGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGNGACAAATTCAGCT 
GTTTTGTGGCATAAATAAGTGTCTAAGCTGGGCAGTTAGTCTACCC 
Sequence 1393 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCI I M II I 1 1 I I I I I I I I CT 

TTTGAAACAATTTTTCTGAAATTTATTTCTAAMGTCAGAGACAAMCTTTAGGAGT 

ACATTTATACTAAGCATACATGCGTGAGCAAAAAAAAATAAGCACAGAATACAAAAATGA 

MTAGTAMATTTTAATACAGTATTCTGAATACAAGTAGAATACCACTAGATAAGAATTG 

TATTTACCTAAGAAATCTATGATAGNGNGGGNGGAGATAAACCAGTTTAGGATAGCCACT 

TCACTATTCACATTTTAATCAGTGCTGACCAGAAGCTAAAGCAA 

Sequence 1394 

ACTTAGGGCGATTGGAGCTCCCCGCGGNGGCGGCCGNGGTACAAATAAGCCCACCCCACT 

AGGMCTATGTTAAMAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGNACAGGTCCCTTGTTTCAAAGGTATGACACANGGTAACCCGT 

ANGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGNTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNGAGAGAAAAAAAAAGGCCATCTATACCAATTCTMGTTAATTTAGACTAAACA 

A 

Sequence 1395 

CCGGGCAGGTACAAAAGGGTTCCTCTATATGCCMCTAATTCCAAATTTTTACTTTTACT 

GCAAAAAMCCTTTTTGGCATCAAAACTCCATTGTTTCTCTGCACTCTGACACCATCATT 

TCAAAGGGGCTCACATAAATGATCACTACTGCTCTCTCCCTMTTTTTGAAAAAGGAGTT 

TTGAGMTAAAACAGTGCTTTTATTATTAGCCAACACAAAGTGTGAGAAAATCATTCCTG 

AGAATTAACATTTTMGCTAACAGAAATTCAGTATACTTAAAACATAATTATATTTAATG 

AGTCATTATTTGGATCTAAAACGGACGCGTGGGTCGAAGACCTCGGCCGCTCTAGAA 

Sequence 1396 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCGCCCTGGGCA 

GTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAA ACTCAA AAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAMTTTTTACAGTATTCTTGAAATGAACCCAATTAAACACATAGT 

Sequence 1397 

AGGTACTTTAATCCCGTCTTACAGMGAGAAAACTGAGATTTAGCAACATMAAGTATTT 

CCCGTAAGTAAACAGTAGAGCCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATG 

CTAT AAAAGAGGAGCATTAGAAATATTTGAAAGATAGAAATGAGAACTAGTCAATATTTA 

TTTTGCTTAGCACTGTATTCAGTATTATGGCATCTTAMGTAGTTMGACTCMTATTTC 

ATCAAAAAAGTTTAAAATCTAATCAGAGAAT 

Sequence 1398 

AGGTTTGAGTCGACCCACGCGTCCGGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGG 

TGCATGGCCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGA 

CTCTGGCATGCTAACTAGTTACGCGGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGC 

AAGATTCCTGATCGGTATACAGTGATGTATTTACTAAACAGAGACCTGTGCAGAAATTAC 

ATAC TATC CATCTAGATAGGTTGTTACACTTTTGCCTATTGATGGMTAGTTCCATTTAT 

CAAGTTTTATACATCAAAAAGCTTTTGAAGTTCACCAGACTGTCCAT 

Sequence 1399 

CCGGGCAGGTACTGTAAATCTACTGTAATCCTGTTTTGCAGAATACTGCACGACGGAGAT * 

TGAGAAGTGAGAGCTCTTATGACATAGATAACATTGTGATTCCCATGTCATTAGTAGCCC 

CAGCTAAATTTAGAGAAACTCCAATATAAGGGAAATACTTACTCCCAGGGTATGGTATAC 
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TTTACCATCTTCATAN I 11 I I 1 CTTTCCCTTCCCTTCCCTTAAAAAAACTNAAN I I I 1 ( C 
NMGGTGGMGMNTTTTTAATTNMNTGGAMGGGANGCTTCCCTTCTTCCCCAGTTCC 
CTTCTTAGCCNATGGGAGGGGGAAACCGGG 
Sequence 1400 

CCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAAC I I 1 1 I I GATGAAATATTGAGTCT 

TAACTACTTTAAGATGCCATAATACTGAATACAGTGCTAAGCAAAATAAATATTGACTAG 

TTCTCATTTCTATCTTTCAMTATTTCTAATGCTCCTCTTTTATAGCATGGGCTCAGGTA 

TCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTACTTACGGGAAATACTTTTATGT 

TGCTAAATCTCAGNTTTCTCTTCTGTAAGACGGGATTAAAGTACCT 

Sequence 1401 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCACCCTGGGCA 

GNGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGNGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAMACTCAAAAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAAATNTTTACAGTATTCTTGAAATA 

Sequence 1402 

AGGTACTCCCATTTCCCTGAAACAAGCAGCCAGCMCTATCTCAGAAATGTGTCATTTTT 

ACTGGTTATAATTCTTAAAAAGCTTG7TTTCCTAAGATATGAAATGCCTGCCAGTATACA 

MCTGCTGTAACTACTTCCCTTTTTGCTTTTAGCGGGGAAAAAATAGCTTAATGACAGCA 

TAGMTCATGTAGTAMTATAATTCATTTTTTGAMGGTTCAGCTATATCCTCTTCCATT 

TGTTTATTTTAAATGATCTAATTGCAMCATGTCATCACTCCCTTGATGTTTACCTNCTT 

GTTATGCATTTTTAGCAGGCTTTATTGTCACC 

Sequence 1403 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 

ACTGAGACTCAGMTGTTTCAATTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 

TGGAAACTGAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I I I I II I I I I I I I II I 

CMTATTATTTATCAMATAAATTTATTAMAGTATTCAAAGACCACTTCAAAGNGTAGC 

TGCCTTCAAGACAGATTTTTGGCACTCATMCGGACACTGCAGTTTTCAACACCATAGCA 

CTCATTCTATTTCACACATCA I I I I IAACAA 

Sequence 1404 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTTCAACTTCTTTCATTGCTCCTC 

TTTTCAGGAAATCCCCACCCTGGGGATACTTCAAAAGACCT 

Sequence 1405 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAAAG 

GGCCAGACACAGTGGCACACGGCTGTAACC 

Sequence 1406 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCCCTAMTATTTAAC 

TGTTACTTGTAMCTTGTGTAATTTATTATTTATTTTMTCAAAATTCTGAATATTT 

TTAAATGAAAGTTGCAAAAAAAAAAAAAAAAAAAAGTGCGGCCCGGCCGCCCGGGCAGGT 

ACATACCTCCTTGCACAMTGGAGGGGMTTCATTTTCATCACTGGGAGTGTCCTTAGTG 

TATAAAAACCATGCTGGTATATGGCTTCAAGTTGTAAAAATGAAAGTGACTTTAAAAGAA 

AATAGGGGATGGTCCAGGATCTCCACTGATAAGACTG I I I I I I AAGTAACTTTAAGGAC 

Sequence 1407 
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AG GTACTCA ATCTAATCCAAMTTTTCTTTCTTAGCMTCTATTTTCTGTATTTAGAAAA 

ATGTTTTTTATTTCAMGAGCCTCTCAMGAGCATTTCACGTATCTTTTACTGI II I I CT 

CTCCACCTCCAAGGGGTCTGTCTAGATCAGTGCGGACGCGTGGGTCGAAGACCTGCCCGG 

GCGGCCGGCCGCCCGGGCAGGTACTGAGGACAAATCAGTTCTCTGTGACCAGACATGAGA 

AGGTTGCCAATGGGCTGTTGGGCGACCAAGGCCTTCCCGGAGTCTTCGTCCTCTATGAGC 

TCTCGCCCATGATGGTGAAGCTGACGGAGAAGCACAGGTCCTTCACCCACTT 

Sequence 1408 

AGGTACATATCACACATTTCCAAATTTGAGACCACTAATGI I I I I I AATTTCAAATATGT 

ATATAMTATGTATTCTTATTTCCMTTATTTCCTTGGCATGAATTCCTAGAAATTGATC 

TATTTAGTATAAGTGCTTTTTTAGCTATATGTCCACTAGTATGGTATGAGAATGCCCTGT 

TTATGCCAGTATTATCATCATTGAATATATTACTGCTGATGTTGTGGTAATACATTTAAA 

CCMTGTGATGGGGCAAAAAMTTATATTTTTACTTACATCTTTAAAATTACTGGNGATC 

TCTG NTATTG AC AAGCTGGGCATANAAAAAGTAAATTAATAG AATT 

Sequence 1409 

CCGGGCAGGTCGGACGCGTGGGTCGAAGCTTGTACAAAAACCCAAGTATCACCTGAATTA 

CMTTATTCTTAAAATTTGTCCTTAAATAGCTTACTCTTGGMGATTTGTTTCTATGTAG 

ACATTATGGTAAAAGTTACTCTGAMCTCTTTTCTTTAGTTATCTGTTTATTCTGAGCTC 

MCAAGATTGAAGTMGTTTTCGGGAGCTACAGAAATTAAATCAAGAAAAGAATAATAGA 

GGATT ATATTCAATTGMGTGCTGGAGCTCTTCTGATATTATCAATTCTCCTTCATAGAC 

ATTTTATAAAGCTCTTTTATGTGAACTCTTGCTTCATCCAGGCAAG 

Sequence 1410 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTAAGTATACTGMTTTCTGTTAGCTTAAMTGTTMTTCTCAGGAATGA 

CACTTTGTG1TGGCTMTAATAAMGCACTGTTTTATTCTCAAMCTCCTTTTTCAAAAA 

TTAGGGAGAGAGCAGTAGTGATCATTTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAG 

NGCAGAGAANCAATGGGAGTTTTGATGCCAAAAAGG I I I I I I I I GCAGTNAAAGTAAAAA 

TTTGGMTTAGTTGGCATTATAGAGGAACCCTTTTTTGTACCTGGCCCGGGCGGCC 

Sequence 1411 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCA7TGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAA 

Sequence 1412 

CCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGAAAAGTTTGCCAACCTC 

TGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGCACAAGACTATGTAGTC 

AGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGAAAGGCGGAGAAGAAAT 

GGATTTGGATAGGTAGTCMCAATGTCTGTTTTATGTTACCACACATTTTCTCGAGAAAT 

TTCAATCAGCTCTCTGAGAACAGATTCATCTTTAAATGAATGTTCATAGGTAACAGCAAC 

TCATGGATCAATGTTGCAAAGTGAGCTCATTTTCACATTGCTTCAG 

Sequence 1413 

AGGTCAAGCTTCGNTCCACGCGTCCGGGAAAAACGGGGTTACTAGTAGCCGCCCATAGCC 
TGCAACCTTTGCACTCCACTGTGCAATGCTGGCCCTGCACGCTGGGGGCTGTTNGCCCCT 
GGCCCCCCTTTGGTTCCTGGCCCCTTAMNMCAGGCNGGTTTTATTAAACCCCAANNNN 
CCCGGNTTANAAGGGGAATTNAAAAAAGGGCCCCGGCTTTNGNAAAAAAAAAA 
Sequence 1414 

NCNGNCCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGNGTAGA 

ACANGGAGTTTTGATCTGTGTTCAACATGATTGCGAACCATCAATTGAGATAACTCACTA 

CCTTCAGGCCAGCCAGNTACATACTTTTGAAAAGCCAAGAGTGAAGCANGGTTGATNTTC 

ATCCMTTCTTGNNCTTTTTGTTAAAGGCANNAATAAGANAGGGTGGNTNCGGGCAATCA 

CTTAGCTAA 



# 



m 
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Sequence 1415 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTAAGTATACTGAATTTCTGTTAGCTTAAAATGTTAATTCTCAGGAATGATTTTCT 

CACT1TGTGTTGGCTAATMTAAMGCACTGTTTTATTCTCAAMCTCCTTTTTCAAAAA 

TTAGGGAGAGAGCAGTAGTGATCATTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAGT 

GCAGAGAAACAATGGAGTTTTGATGCCAAAAAGGTTTTTTTGCAGTAAAAGT^ 

GGMTTAGTTGGCATATAGAGGAACCCTTTTGTACCTGCCCGGGCGG 

Sequence 1416 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 

CAGTGCCTGCCACATACTAAGTGTTGGATMGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

MGTGCNGGCCGGCCGCCCGGGCAGGTCAGATGATTGCAGAATTTATGTGATTACTGGGT 

ACTCTMTGGTAAGGAGAMTTAAGACCAGCTAGTTGTTMTCTTMCTTTTAGTCATTA 

AGGAGAATTTCCAAGACAAAACTGCAATCCAGCTGCTTACCTAGGAATACGGCTTAGGCT 

GAAAACTTCTATCGTCTTAGAAATGGGA 

Sequence 1417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAATCCTGTGAGAAAG 

ACAGGACAGAAACCACTGTGCCTATTTTACAGATACGAAAACTGAGGCACAGGTAAAGGG 

GCTTGTCTGTAGTCCCATAGCTAGCAGATGGCTGGAGCCAAGACTGAGGCTCGTTCTTCA 

ATGCTGAGCCAGGGCTCCTTCCGCTGCACCACAAGAACGCTAGACCACTCGCCACCAGCC 

TTCTCATTCCCTCTTCCTCCATTCTAATCATTTCTAGCTGGCTGGCCTCCACAGAGCATA 

GGAAAACAGCCAGGGCCGGGCACGGTGGCTCATGCCTGTAATCTCAACACTCTGGGAGGC 

CGAGCCGGGTGGATCACCTGAGGTCAGGAATTCGAGACCAGCCTGGCCAACATGTTAAAA 

CCCCATCTCTACTAAAAATATAAAAATTAGCCAGGCATGGTGGCGCACACCTGTAATCCC 

AGCTACTCAAGAGGCTGAGGCAGGAGAATTGCTTAAATCTGGGAGGCGGAAGTT 

Sequence 1418 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGNCCGAGGNNCAAAAGAGAGACAAAAGG 

GTTCTCTTGGAAACAAGAAGAGTGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGT 

TAATGCAAAAGATCAGAACAGGGCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGT 

TTTCACTGATGATCACCTTTCACAGCATTTTCCCCAACCAGCATTTCACTTAGTCTTCTC 

TATACCCAGCACCTCCCCCGGCACCCCCGGCAAGCCCACTATCACTTCCGACTTCCAACG 

TGGCATCCGTGAGATCTGTCCACATTAGGCGAAGCAGGAGAACACTGAGAGCAGCAGGAT 

GGGTTTGGAAAGAGCATGCCTCTGGAAACACAGCTTCCTGGGAATTCACATGAGGCCAGT 

CCT ACAG AGAGCAAGATGCACCCCAGGATTTCTTCATTTTCTAATAGATGTGGGAGTGCT 

CCATTTTCCCCGACAGCGAATTTCCCCTGAGAAACGATACTAGACCCTGGGTTTGCCCAC 

CTTGTMCTCTTCCTTATCTNCTCCTTTTCATCCCTAATTCA 

Sequence 1419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 

CACCACACTCTACAGAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1420 

CCGCGGTGGCGGCCGAGGTACACTGTAAATAGCCTTTACCAAACGTGTTTGACAAGGACC 

ATAATTMCATCACTTAGTGAATTGTGATAAAGAAAAAAAAGCCATGATTTATTCGATGT 

GATTGGCTTGTTTTTATGTGGCGCCAAGAACGAACCTGTTTAGCAGCTGTAACCAATGGT 

ACGCGGGGGAGGCGAACAATGGCGGAGCTGGGCGAAGCCGATGAAGCGGAGTTGCAGCGC 

CTGGTGGCCGCCGAGCAGCAGAAGGCGCAGTTTACTGCACAGGTGCATCACTTCATGGAG 

TTATGTTGGGATAAATGTGTGGAGAAGCCAGGGAATCGCCTAGACTCTCGCACTGAAAAT 
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TGTCTCTCCAGCTGTGTAGACCGCTTCATTGACACCACTCTTGCCATCACCAGTCGGTTT 

GCCCAGATTGTACCTGCCCCGGCCGCTCTAGAACTA 

Sequence 1421 

CCCCGCGGTGGCGGCCGAGGTACTTTGGGAGACCACCCCCAGCTATGGTTCCATACACTT 

ANACTGCGCCCAGCTACAGNTTNATACACTTNGGACAAANTATCTGATAAAATAGAGAAA 

AAMTCTTATTTACTATAGCATTACATAATAAATTTNTGAGAAAAAAATTAACCAGGGAT 

GTAAA AAACCTTTACMTAAAAAATAAATAAAAAAGGMGATCCAMTAAATTTTAAAAT 

ATTTTATGTCTTTGGATTGAMGAATMATATTAATAAAGTGCCATATTATCCAAAGTGA 

TCTATAGATTCAATACACTTCCTATCAAAATTGCAGTATTTTTTTCACAGTAATGGAAAT 

TCMTTCTAAMTTTACATGAAACTAAAATAAACTTTGAATAGCCAAAACAGTCTTGAGG 

AAMGGAACAAGGCAGAAGMTATCATACTTACMTTTCAATCTATATTTGAAGACTTTA 

TAGAANTAAAA 

Sequence 1422 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATCCAGCTAATAGCAAAC 

TTTGTGGATCCCMTCCGAGATTTCCCTTGCTGAAAGACAAGAAAGTATCTCATATAAAA 

GTGCTGTAGCAAGTATTTGTATACTCCAGAAATAAGCTTCTGTAATTCTTAGCTGCCAAT 

GTGTTCAGGCGTGATGACTCGGTTTCTGTTTCTCTGAACATCAATACTAGGGTCTGTATA 

ATTTCAATGCATGCCACCAGCTTCATCAACCCTT 

Sequence 1423 

AGGTACAATCAGAATGCTGCATTCTCCAGCCATAAAGATCGCTCCCTCTTCTTTTCAAAC 

ATCCCTGTCCCTCAAGGTCTAGCTCAAGACGGTCACCTTAAGAAAAGCTCCCTTTGTCGA 

GCAGTGACTCCATACCAGGCCCTGCTTTAAACGCTTTATCTGCATTATCTTACTTGATTC 

TCGCAATAGCCCTGGGTGGTAGGTGCAATTATTATCTCCAGTTTATAAAAGAAGATACTG 

AGGGTCAGAGAAGTTAAGTGACCGGCTCAAGGTGTCACATTCAGTAAGCGTTGAAGGGGC 

CTGTGTTGGTCTGTCCTTGMGATGCCCCCTACCGACTACACTTTCAATGATTTTCTGCC 

TTGAACCTGGCCCCATGACTAAA 

Sequence 1424 

NNCAAACCTCCTATGCTTTCCTTGGCATCGGCTACACATCATAGTATTCATTGCCTCCTT 

GAGGTCATC7TGCAGCTTGGACAGAAACTCATTTACTGACCGGCTCAGCTCATTCTCTGC 

CATTCGTTTCATCTCATACTCCTTTCGCTTTTCAGCATTGCTGACAATGTCCCAAGCTGC 

TCGCAAAACCTTGAAGGCCTCCTCAGCCCGGGGATGATGATTTTTGTCAGGATGAACCAT 

CACTGCCAGCTGTCTATAG 

Sequence 1425 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGAACTGATTTCCCC 

AAATGGCTCTTTGTG CAGCC GCAGCAGCTGTTCAGACTTATGAGCAGACAGGAACTGCCA 

GAGGGCMTGGTGTGTTTTMCTGGCATCTGTTTMGGCCTTTAACACGTGAATCGTCTG 

ATCACCCATTTGCAGGATGTCTTGAGTATACACATTCAGCTGCATGTTTGGATCCCCACC 

AGCTGTGCTCAGAAACCCCAGAGTGACTTCTACGACAGACAGCACTTCACAGGCATCGCT 

GTAGGACTGCAGCTGTCCACTGATGGCACTAATGACCGAGCTGGGGAGGGAGTCCTGGGA 

AATGAAAAGCAGGAGAGGGATGTCTGTGGGCTGGGTTTCTGGCATCTCACCACCTGGTAA 

GAGAGCCGAGCCCCTTCACTGCCCAAGCCACATGCG 

Sequence 1426 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGGCCAAGGCTGCAAT 

CAGTGATTCAGCCGACTGCTCTTTGAGTCCAGATGTTGATCCAGTTCTTGCTTTTCAACG 

AGAAGGATTTGGACGTCAGAGTATGTCAGAAAAACGCACAAAGCAATTTTCAGATGCCAG 

TCMTTGGATTTCGTTAAAACACGAAAATCAAAAAGCATGGATTTAGGTATAGCTGACGA 

GACTAAACTCAATACAGTGGATGACCAGAAAGCAGGTTCTCCCAGCAGAGATGTGGGTCC 

TTCCCTGGGTCTGAAGAAGTCAAGCTCGTTGGAGAGTCTGCAGACCGCAGTTGCCGAGGT 

GACTTTGAATGGGGATATTCCTTTCCATCGTCCA 

Sequence 1427 

AATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGAC 
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AAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTC 

ACCMCTTGACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTG 

GACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGG 

CTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTAT 

CAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTA 

CCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

ATTGAGGATGCGCTCMCCMCTCTTCCGAMCAGCAGCATCAAGAGTTATTTTTCTGAC 

TGTCAAGTTTCAACATTCAGGGTCTGTCCCCAACAGGCACCAC 

Sequence 1428 

AGGTACAAATAGATACCTTCAAGGAGAATGAAAACGGGGAATATACTGAGCACTTACACT 

CGGCCAGCTGCCAGATCAAAGTTTTCAAGCCCAAAGGTGCAGACAGAAAGCAAAAAACGG 

ATAGGGAAAAAATGGAGAAACGAACACCTCATGAAAAGGAGAAATATCAGCCTTCCTATG 

AGACAACCATACTCACAGAGGTAMAAGATTTCTTTTGGTGACAATTTCAGTTCATAATT 

TTTMTCTTAAAAATTCATCACTTCCAMCTGGTCAGMTTTACTTCTCCTAAGCCTTGA 

GGGACACAGTATCACATGGATTCTGTGTCCAGCGGCCTTAACAGGAAGATTGCTTTAGAA 

TTTGGCACGAACCATGCCACTGTCTCTGT 

Sequence 1429 

NCNGNCCAGGTACTCNNNNACANTGNAAACTNNTCANGNGCCCATCATTGCTGGATTTGT 

ATTTAACATTATGTTTCACCCAGACAACAGCTCAGAGAACTGGGCAATGGCTGCTNATGT 

GTTGAGCCGGGGCATACAGGATGAAGAGGGACAATGAGAGGGAATGAATTCTATTCTANA 

CACCCTGAGTTTGAGGAACCTATGGAAATGTCCAGGAGGCAACTAAATGAAACAGCCTGT 

GGTAGACAGAATAATGGCCCCAAAGATGTCTACAGCCTAATCCCAGGAGCCTGTGAAAAT 

GTTCCCTTCGCATGGTAAAGGGATGTGGCAGATATGATTAAGCTAAGGATCTTGAGATGG 

AGAGTTTATCCAGGATTATCCAGGTGTGCCCAGTATAAT 

Sequence 1430 

AGGTACGCGGGACACAGGGTCCTGTGCAACANGGNGGACTAACAGTAACACCGCCACGCC 
GGCAGCAMGCTCATTTTGGTCCCCGCCCCGTTCCTCTTTCTCI I I I I AACTCCTTCCCT 
CTTTGCGGATTCTAGMCGGAACCTTTTTTTAATTCTTCCCAGTAGAAACGTAGGAACAA 
TTTCGTGAACGCAATCNGGAGTGCCCAACATGGC 
Sequence 1431 

AGGTACCCCTGTTTAACAAGGGGTAGGGGCCTTCTGAGACTGTTTCCTCTACAGAGTAAG 

GGTTCGTTCAGCCTTTTCCGTGGCCTGCCAAGAACTCAACTCCATGTTCCCTCACTTCCT 

GTMTTGACCTTGTCCAGGACTTTCTGACCTTGGAGAATTCACCTTTGCTCTTTCTGCTG 

CTTCGTGCATTCTTCCACCAAATGTCTTAACTGACTGGGCTCCTTTCCAAACTCAAGGGC 

TTTGCCAAATGCCACCAGCTCAGGGAGGCCTTTNCTGGCCATGACACTTGAAGTTGCAAC 

ACTCCCCCGCAGTCTCCCGTGCCCCAGATGTAAGTTCCATGAGGGCAAGCCCTGTGCTTT 

TACCACCATATCCCCAGCATCTTGAGCTGTGCCTGGCCCMGAAATATTTGTTGAATGAA 

TGAATTTAAAAGGGGATATTCATGANGGCTTACACATTCTCAATGGGTT 

Sequence 1432 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACl I I I I I I I I I CI I I I I 
CI I I I I I I 1 1 I I I I I I I AAGTTTTTGGGATCGTGTCTCACTCCTGTTGCGCATGCCTGTA 
GTCCCAGCTACTCAGGAGGCTGAGGCAGGATAATTGCTTGAACCCGGGAGGTGGAGGTTG 
CAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTTTGTCTT 
CGGAAAAAAAAAAAAAAAAGATTTGGCGGATGAAAATAACCAGAATGAAAATAGCTNGAA 
AACTCANCAAGCAGGAAGCTCCCCTTCTCACCCTTTTGTTCCCTTGCCGATAGAATCAGT 
CACTATTAGAAAAAATGAAAGACGCTCTGTTTAAAACAATGATGACAGCAGTACCT 
Sequence 1433 

GCGG TGGC GGCCGAGGTACTTCCCTTTTAAGAGATGAGTCACCGCAACTGAAACTTCTCT 
ATTTCTTTTCTTTTCTGATTGTTCTCCAGAATTAGGACTAGTAACAGTCCTGAANNCTTG 
TNTTTCCTTATCTAGAAAACTCAGTATCTTCCCTTCCGTTTGTCTTAAAATATTAGTACA 
CGCTTTCTCAAGCCTAGCCGATTAGAAGGGGCTGCCGGGCTTCCACCACACCTCATCGAG 
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GNAATGGTTTTNTGGNNAAAAAGCCCATGGAAATACTGAGCCCATGCCNCTCACGTTGNA 

AAAGCCCCGTTCCTTGCC 

Sequence 1434 

AGCTCCCCGCGGTGGCGTMCTTATCTCATTTTAGATNAGTTTGCAAAGAGAGTTGGTGG 

CTAAGGCCATAGCTTAGCCTCCTGACCCCTACCTTCCCACGTTCTTTCCAAGAGATTCTC 

CTCAGGAATMCACTTGCAAGGGAGTTCCTGATGAAGTGGATTCTTGTTATTCTAGGAAT 

AGGCCTACATGGTGCACTGGCAATGTGAGATTATACCTCAGCATTTTCAAAGAGCATAAA 

MTC TAGAGC TGGGGGGTTTAAAACATGACAMCCTAATTTTAAGTAGGCAGACAAATAT 

TTAMTTTTTCCCCTACCCTTGTTTCTACATCGGTCCATTCAGACTCTGCACCATCTGGT 

TGGGCAGGTGCTACTGTGGMGATCTTCGTTTTCGACTACCATTTGGTGATTCTTGCTTT 

MAGTCTCMTATCAGTAAACTGAACAGATTNCCACCACCCCTTGTTTTATAAATATCAC 

CCTTMTTAGTTTAAGTTTCAATCTCCCCATCGGAGGCTAGTTCCTGGTGGGTCAGCATG 

TACCTGCCCNGGCNGGCCGTCTANAACTAAGTGGATCCCC 

Sequence 1435 

TACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTNTTTTT 
TT1 > II I II I I II I I I IGGAGCCAAAATTGTGTGCATTCCTACTGGGAAACACAGTGGCC 
MATCCTTTTGAATTGTTTCCTTCTAGAGACTTTAACTCTTCTGACTGCAAATCTTAGTG 
TCCTGTGAGTATTAGTTGATTAATTATACTTGCTGCTTAGTGAAATACAGCCAGCTATAG 
GTATCTTCTGGAGTAGCTCMCACAACTTTTCTCTTGCTAGAGTGACTCTTGCTAACAGA 
ACCCAAAGATGCGCACATATACCCACAGGAGCTGGAGGTCCCTCGCATGCTCCTCTCGTG 
CCAGCCTTTGCCTTACCCTTCACTCTCTCCCTCCAGGAGCCGTCGGTACCTCGG 
Sequence 1436 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATTTGTCTTGCTC 

ACCTCTTACCAAATCCAAGAATGGTTTCTATCCAGTGAATGGCAAATTCATCTGATAGGT 

MGGGAATAATGGGTCAAAATGGTAGCAMCACTTCTTTCAAATTTCTACTAAAAGACTT 

GCTGTTGTTTTTCTTATAAAGGGGCAATTTCATCAACATACATCTTTTAAAGGAATCTCT 

AGAAATTTGAGTGACTTTTTGGCCATAATCCTGTTTGATATATTTTTGGTCAGCTGCTCA 

MACAMCATTCTCCTTGTAAGGTTATCTATCTGAAAGATACTAATTCATTTAAAGCAGC 

TGCAGGTGAACAACCTAAAGATGACATGATTTGGGAGAAGAGGAAGGCAGATTACTGAAC 

TGACAAGTGACCCAAAGCATAATTAGGTTTGTGCACATGGTAGCATGGAGGTTCCACACC 

TACCTTCTACAGCGTATTAMATMGAATATTGTCTTTGAMCATCTTCTAGCACCTTTT 

TAATAAMCAMATTTCCCATCTTCMTTCTATTTTTTTCCCAMTCTACCTTTAAAAAA 

TTGT 

Sequence 1437 

CCGCGGTGGCGGCCGAGGTACAATMACAGGGAATGAGMCTATTTACATGGAAGTTTCT 

TTCTCATGATGCGGTGGAGAAGCCTCGGCCACTTGGTTCTGCCAGATGTTCCTGGGGTTA 

CTGTAAATGGGAAGGACAGGCAGAGCTMACAAGGTAGGAGAATCGCCCCCCTTTTTTGA 

ATGTTTMAGAGTTTGCTGCAGTATGCTGCATTCCATGTGTGCTGCTTACGGGAGCCAGG 

GA AACTG GGATTCCACTMTTCAATTGTMTACTTGCGGGGGACCCTGGAGTTTTACGTA 

ACATTTTGATTTGGGAAAAAAAAAAAAANAAAAANANTGTTCCTGCCCGGGCG 

Sequence 1438 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGGC 

ATGCTCCTGCCTTGGGCAGAGTGTATATGTGTGAGTGCACCTGCCCTTCCCACAGCCTAG 

MCGTTCCTCCTCCAGACAGCACATATGGCCTGCTCTCTCACTTCCTTAAGGTCTTTATT 

CAAAAGTGACTTTCTCAGTGAAGCCCTGTCTGCTCACCCTGCGTAAAATTTCAGCTCTTC 

TTTCTATCTCTCTTCCC AG AT I I I I I I I CTCCTTCATGTTCGTTGGTGTCTAAGGTTTAT 

CATCTATTTCGCTAATGGTCAGTAGMTGTMCCTCCACGTMGCAAGGAGTTTTGTCTG 

TTTTGTTCATGTCTATGTCCTTAGTGCCTGGAGCATTCCCTAGTATGCAGTAGGTGCTCA 

ATAAATGTCAGTTGGATTAATGGCTGAAAGAAAGGTCACCGCTATAAGGATGGAGTCAGA 

GAACAAACACAGTTMTTCCTGGTCCACTGTTTTTGCTTCCACTAMTTGTATTTGGTCT 

ACGGCTTCTCCGCTTGCCCTGGAACCCTGCTCAGAACACTGCTCCCTTCTCCTTCTTCTT 
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CTCCCTCCGGATAAATTCT 
Sequence 1439 

CCGGGCAGGTACCGCGGGGGGCCGTGGTCAGAGCGAGCTTCGGAGAAGCAGTGGTGGGTT 

CCATGTGATGGTGGAGTAGGAGGCAGGTCTCCGCGGTTCATCTGTGTTGCTCTAAATGAC 

ACTGTTTCATTATTTTGATGGCtGGAGAATATTTCCTAAGTGTATGTATATGAAGAAGTT 

TCTTGATCTCTTTATCTGTGGATGAACAGCTACTTTGAAACATATGGTACCTCTGTGGTC 

AGACCATTTGCCMGCTTGTGAGGCCTCCTGTTCAGGTATACGGTATTGAAGGTCGCTAT 

GCCACAGCTCTTTATTCTGCTGCATCAAAACAGAATAAGCTGGAGCAAGTAGAAAAGGAA 

GTTGTTGAGAAGTAGCACAAATCCTGAAGGAACCCAAAGTGGCTGCTTCTGTTTTGAATC 

CC 

Sequence 1440 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGTGCT 

TTCATAGCCGCGGACACAACTTGGGCCCACAGTTAACCGAGGAGGAAGGCAGAGCGTGCT 

GAGCAGAGCACCAAGGAAGAGAGCTCGGCTAGCCGGGAAGGTCCGAATGGATTTATTTGG 

TGAGGCCAAGGAACCCACTGCCTCCACGGTGTTCCTCCAGGAGGCTCCTCCCTTCACTAA 

GGCAGCAAGGAGTGCACTGGTGAGGGGAAGAATTGTTGTCATCCTTTCTCTACGACTCCA 

AGAGAAACTTTATACTGGAGGMGAATATTCTACCACTTTGGGATGCTTCCAAAGAAATG 

GGATACCAAAGGAGTGAACTTCAGGTCATTGGAAGTGGCCAAGCTGGAGGCGACCTATGG 

AGACATGACCTTCAGTTCTTCCTGAGATCGATAAAGCAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 1441 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGATATTGTCAATGCGCAACATGGAGAG 

ACTTTAAAACAAATGCTAGGGATTAGAGTATAGATCAGATAGCTGGCAAATCTATAGGAA 

GGGAA AAGTAATTTTAAACACACAGCATTTGTTTCCTGCTGCTCTATCACAATAGCTAGG 

TTTTTAAATAAGTAGGCTTTATACCMGCCATAAAAATGAATTGCTGGGGCTCTTTGGGA 

CTAGGGAAGGCGGGAMTTTTAGATATTGCTGTTGGCTTAGTGAAAATGCATGCTTACCC 

GGTCACCTGTGGCTCCAGCAGGACCAGGGGCACCTACAGCACCAGGAGCACCCTAGTACC 

T 

Sequence 1442 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTTA7TACATA 

TG ATTTTT ATT A GTTTCTG G A G G C AAATTT AATTTTT ATTTTAAAATC A AMTCT ATTTT 

AAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTTAAATAATAAAAAATTTAA 

Sequence 1443 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGGTGCTCCTGGTGCTGTAGGTGCCCCTGG 

TCCTGCTGGAGCCACAGGTGACCGGGTAAGCATGCATTTTCACTAAGCCAACAGCAATAT 

CTAMATTTCCCGCCTTCCCTAGTCCCAMGAGCCCCAGCAATTCATTTTTATGGCTTGG 

TATAAAGCCTACTTAT TTAA AAACCTAGCTATTGTGATAGAGCAGCAGGAAACAAATGCT 

GTGTGTTTAAAATT ACTTT TCCCTTCCTATAGATTTGCCAGCTATCTGATCTATACTCTA 

ATCCCTAGCATTTGTTTTAAAGTCTCTCCATGTTGCGCATTAACAATATCCTAATGCACT 

GAGGCTTCTCAAAGCCTTCAATTATTACCAAAAAAAAAAAAANNTTTNNNAGGTACCT 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACCAAT 

TCCTTCCTAACCTGGATTCCACTGTCCTTGGTGAAAACTACTTTGATGGAACCTACCAGA 

TGCTITATCTTTTGGTTAAAGGAACTATACCTGTNGAAATTCACACTGCCACAGNGATAT 

TTGTTTCTTTCCAATTATNTGTTGCMCANAAGATGACTTTTATACCTCTCACAATCTGG 

NTAAAAATCTTGCCTTGTTCCTAAAGATACCAAGTGACAAAATCCGTATCAGCAAAATAA 

GAGGGAAGAGTCTGAGGAGGAAGAGATCCATGGGATTCATAATTGAAATAGAGATTGGAG 

ACCCTCCTATTCAGTTCATAAGCAATGGCACCACAGGTCAGATGCAGTTATCTGAACTCC 

AGGAANTTGCTGGTTCTCTTGGACAAGCTGTNATTTTAGGAAA 
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Sequence 1445 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAGTCACTCCTGCC 

TTCACCATGAAGTCCAGCGGCCTCTTCCCCTTCCTGGTGCTGCTTGCCCTGGGAACTCTG 

GCACCTTGGGCTGTGGAAGGCTCTGGAAAGTGTAAGTTGGAGTCACTCTGGTCTAATCTG 

GGCTGCAGGGTCAGAGGTGGGGTCTCCTTGTGGTGTGGGTGTGTCCCCTTCTGTAGGCTC 

TGATCCCTCAGCTTAGTTTCGGGAGACCTCCCTGAGGGTGGAATACATGTCTGGCTGGGC 

TCCMGGTTTGTGTGGCAGTTTGAGCTTCTGGAAATGCTTCCTCTATGCAGCCATGCTGT 

CAGCCCAGGTCCCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATACTNTGCCT 

TCTTTCTTTCACCTGTCTGCGACTCTTCAAAAAAAAAAAAAAAAAAA 

Sequence 1446 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTAATCCAAGACAA 

AAGMTAGCTCCTTTCAGAGT TCTCA TCATTTCTTCACCCCAGCCTTAATGGGTATATTC 

CTTTTCCCCAGGTGTCCAGAGTTTTCCAGAATTGTCCCAAGGCTCAGTCCACACCAGTTC 

TTCTCCAGTGTGCTCTCCTGAGAGGCCAGGCACACTCAACAATTATCTAGATGAGTTCCC 

ACTTCTTTCCAAGTGAAGGTTCTGCTATCCTAGCATAGTCAGAATAAGTTAAATTATGTC 

TCTTMGAGGCACTGTTCCACTCCTTTTCAAGGNGTGTGGCATATTTGAAATATGTGACT 

TAAAAGT CTAC AGTCTCTTACCAAAAGCCTTGGGCCAAATACATGTCAAAATTCAGAATT 

TTNCAGATTTTAGAAAAGTGACCCATATACCATACATTGCA 

Sequence 1447 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCAT 

TCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCAT 

CAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCA 

CCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACA 

AAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

TTTCTGAC 

Sequence 1448 

TAGGGCGAATTGGA GCTC CCCGCGGTGGCGGCCGAGGTACMATTGTCGTTTTTATTCCT 
CTTATTGGGATATCATTTTAAAAACTTTATTGGGTTTT^ 

ACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTC 

CAGC AAGCA M GTGC TTTGACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAAC 

TGTTCTTTTGCATTTTGCCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATT 

TATAGCAAGATATTGATCAAATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAA 

GTATMTCTTTC TATAG CCMCTGCTMTGCAMTTAMACATATTTCATTTTAACATGA 

TTTCAAAATCAGTTTTTCATACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGC 

AGAACCACA MCAATTCG AATGGGGTAGAAACATTGTMATATTTACTCTTTGCAAACCC 

TGGGNGGTATTTTATTTTGGCTTCATTTCMTCATTGAAGTATATTCTTATTGGAAATGT 

ACCTGCCCGGCCG 

Sequence 1449 

CCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTC 

ACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAAC 

TTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCC 

AGCCTGGTGGAGCAAGTC7TTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATAT 

TCCCAGGACAMGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAG 

GATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1450 

CCGCGGTGGCGGCCGAGGTACAMTTGNCGTTTTTATTCCTCTTATTGGGATATCATTTT 
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AAAAACTTTATTGGGTTTTTATTGTTGNTGN^ITGATCCCTMCCCTACAMGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAACTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

AACTGCTMTGCAAATTAAAACATATTTCATTTTMCATGATTTC 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 
ATGGGGTAGAMCATTGTAMTATTTACTCTTTGCAMCCCTGGNGGTATTTTATTTTGG 
CTTCATTTCAATCATTGNAGTATATTCTTAT 
Sequence 1451 

CCCCGCGGTGGCGGCCGNGGNACAAATTGTCGNTNNTATTCCTCTTATTGGGATATCATN 

TTAAAAACTTTATTGGGTT^4TTATTGTTGNTGTGGGNTCCCTAACCCTACAAAGAGCCTT 

CCTATTCCCCTCGCTGNTGGAGCAMCCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTNCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAACTGGTCTTTNNCAT7TTG 

CCNCTGNGATATGNCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAN 

CAAATATNGAAAGTTGNTATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTAATGCAMTTAAAACATATTTCATTNTMCATGATTTCAAAATCAGATTTT 

CATACTACCCT7TGCTGGAAGAAACTAAAAATAT 

Sequence 1452 

CCCCGCGGTGGCGGCCGAGGTACAAATTGTCG I I I 1 1 ATTCCTCTTATTGGGATATCATT 

TTAAAAACTTTATTGGGTTTTTATTGTTGNTGTNGGGNCCNTAACCCTACAMGAGCCTT 

CCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTTCTTGCTTCANTCATCAGCCAGCAAGAGGGMCAAAACTGTTCTTTTGCATTTTG 

CCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAT 

CAAATATAGAAAGTTGATATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTAATGCAAATTAAAACATATTTCATTTTMCATGATTTCAAAATCAGTTTTT 

CATACTACCCTTTGCTGGAAGAAA 

Sequence 1453 

GAANCCCCCTTTNGACGAANANNCGCGAATCGNGAGCTCCACCGNGGNGGCGGGCCCGAG 

GGGGACNANGANTTTTCTTGNNCN I I I I I I I I NNAAAAACNGGNGACTATTTAATCCATC 

TAAAAATACAAATCAGGNAANGGGGGGAACCATAGGAAAATCCTCCACCTNTAACAAGAG 

CCGAAGNTACNGGGCTTTCTGCTTGCTCCAAANAAATCCCAAAGGGCTTGGATAGTTTGN 

GGAANGGGGAATTATCTGTGTCTTCAAACTAACTCCCAAGGATACCTCAAAGGACATTAA 

AGGTNTACCACCACCATTCTGGGGGMGAAAAAGGGGGG7TTCTTGCCCCTTGCTTGAAA 

AGCCT TANAAAT NGGGGGAAGCCTCCAAATNGCCNTTNGGGGGNGNAAAAAGGGGNNCCC 

TNTNAATTTTTTNAAAAMAMMAATTGGGTTCCAAAAAANAACCCCCCTTTNGAAAGG 

GANCAAANGGGGGGGGGGGCCCCCTNCCCCNAAGGCNAAGGGGGGGGGCCCNNTTNGGGG 

GGGCCNCNTGGGGGGNNGGNNCCCCAAAAACCCCCCTAAACAAAANNTNGGGGGCTGGGC 

CCNTAANGAGGAGGGCCCCNCCAGAGCCCAAAANTTNGGGGTTTNNACCCCANNNAAAAT 

TTTGGGGGGGTN I I I I I I I I I ACCCCCNTTTNNGNNCCCCNCCCCCCTTGGGGGGGAAAA 

AAAACCCTTNAAAACCCCNTTNTTTNAAAACCCCCCCGGGGGGGGG 

Sequence 1454 

CTACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTTN I I I I I I I i 

MINIUM I ATACCAGATTTTAAGGAAAGACTGCTTGCTTCTGANAAAGAATTCTCGG 

AGTTGATTCTCTGCTCCATTTGCTCCTTTCTCAACCTCTTTAGTCCTCGTTCTCTGCAGT 

MGAAGCTAGTGACMCTGGMTTTAGCTCCCAGNGGGCTTTCTTCCGGGTGGNGCCTGG 

ACAGGCTGCTCCTGCTGCTAAGGCTTCTGGAGCTGTTATTGAAGATGTCAGCTTCTGCCA 

CTGNGGTCTGCTTTTTGGAGTCTGCATTGGTTTTGCCTCGATCTCTATCATTCTTCTCAT 

TATTTTCATGAATGAA 

Sequence 1455 

CGAGGTACTGACCTCGTNTGTCCCTTCCCCTNCACCGNTCCCCACAGCTTTGCACCCCTT 
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TCCTCCCCATACANACACNNNCCATTTTATTNTTTGGGCCATTACCCCATACCCCTTATT 

GCTGCCAAAACCACATTGGGGGCTTGGGGGGGGGCCAAGGGGCCTTGGGCATGGGACCAA 

GGAC C ACCTCCCC CTACCCATATCCCTCCCCGTGTTGTGGGGTTTNGGGGAAAAAAACCT 

TTTTGGI I I J I I I I GGGGGGGGTC I I 1 1 I I 1 1 I CCTCGGAAATTAAANAAAAAAAGGATT 

NCTTACCTACAAGAGAAANAAAGAAAAAAAGGGGTACCCTTGGCCCCGGGGGCCGGGGCC 

CGGCTTTCTTAAGGAACCTAAGGTGGGGAATTCCCCCCCCGGGGGCCTTGCCAAGGGAAA 

TTTCCGNATTATTCNAAAGGCCTTTAATCCGGAATNACCCCGGTNCGNACCCTTCGGAGG 

GGGGGGGGGGCCCCCGGGGTAACCCCCAAGC 1 1 I I I IGGTTTCCCC 

Sequence 1456 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATAAGACCC 

TGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAG 

AMTGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACtTCTACCTGA 

ATTTCACCATCACCAACCTACCATATTCCCGGGACAAAGCCCAGCCAGGCACCACCAATT 

ACCAGAGGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCA 

TCMGAGTTATTTTTCTGACTGTCMGTTTCAACATTCAGGTCTGTCCCCAACAGGCACX5 

ACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTG 

CCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGGTACCTGCCCGGGCCGGCCGCTT 

CGGCTTTAGAACTAGTN 

Sequence 1457 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCCAAATTTTAAGGAAAATGA 

GTCCCGCAATGAGTTTCCTCATGCTTCGCCTGTGCGTGGACCGGNCAGCTTCTGGGTGTG 

ACTGGAGCAGGGCTTGTCTCCTTCTTCAGAATCACTTTGCAGGGGTTGGCAAAGCCGCTC 

CCATCCACGTACTCTCTGGACACMTAATTTTGGCCTATTGCCATCAMTGCCCATTTTC 

CACTGCTGGAAGCAATGTCAAAAAAGGGCTGGCCCAAAAAAAGACCCAGAGCTGTCAATA 

CMCACTGGAGACAGATGCAACTGMTMACCCTGTTTTACCCAATTGCACTATTTGGTC 

CT 

Sequence 1458 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTGGAACAGGGATAA 

GTTCTTGGATAAGGNGCCAACATACCTATAAMGCTGATTTTTGAGTAAATTATTGATTC 

TAACATATGTAATGGATTTGGTGTGATMTTTTCTGATCTTTAACTATAAGTGACTTTTT 

ATTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAAT 

GGAGAAACGGAAAAACTATCTCTGTTAAAAACTGATTCCTGTCATTCTTCTGATATCAAA 

TAAGAGGAAGGAAAATAAACTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTG 

CAGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAG 

GTCAGGAGTTCGAGACCAGCCTGGCCAACATGGNGAAATCACGTCTCTACTAAAAATACA 

A 

Sequence 1459 

NGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCACTC 

AGGGAGCTCAGATTTTGAGACAGTAGCTGGCCGATGCTCCCAGCTGAATAAAGCCCTTCC 

TTCTACA AAAAA AGAAAAAGAAAAAAGAAAACAGGATATCTGAAATTAAGACTGCAGGAT 

GGAGTAGTTTTCTGAAAATGACAGGGTCCAAGGTGTGACNCACCGGGACCAAGTGGCTGA 

ACTGGAATGAAGTTAAGAAGCCAGTAAGAAAACATCNCGGATAATATGGTGGATCAGTTC 

MCAGNAATGACATTATTTCACCCATGGTCCCCAAAGGGAGGGAGATGACTGGAGNATTT 

CAMTTCTTCMGGCMGCCCTCMTGCCAGCCCAGAGGATTTTAANGAGGGGCCTATTG 

TTGTTCCCAGAMGGAGGACTCTGNGGCCAAMCCGCCAAGMTGGGAmCMGAATTT 

ACTTCA MATT CCTGTCAGGATTTCTTTAACCCTGGGTGGGGCTTATACCCAAAACCCCA 

MAAAACTTTTMGCCAGCCTTCNTACTTTTTGGCTTAATTTTTCTCCCTMAGCCCCM 

CCCTTGGGCN \ WWW 

Sequence 1460 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGGCAGGTACTCAGAAATCCTTCCCCGAATT 
TACAGCACAGGCAGGATGACCTAAGAGGCAGTTTACTTCCCTGAGACCCACAGTTGGGCT 
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GTTCTGGAAACACATCTGTGAATCATAGCCAATTGCCACAGAGAAAACAGAACCAAGCCT 

CCGGTGAGGCCACTCCACCCCAGAGAAGTCTGCAGAATTCCAAGGACTCGGATTGGATGT 

TCAGMTTCAGCAACTGGAAAGTCCTTAAAAACAAACAGGCCAAACCAAATCAATATTGC 

TGTTTCTAGATGTCCCTTCTGTGGTTGAGCTAGTTTTACAGAGATAAATATATTAAGACA 

AGGAGGT GGGGGTGTTATATGATCMTGATAGCCATTTTGAAAGAGAGGGAGGAGTACTT 

I I I 1 1 1 1 I I I 1 1 I I I I 1 1 ICCAAGCACGTGCCACTTTATTGAATGACACTGTAGACAGGT 

GTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGG 

Sequence 1461 

AGGTACGCGGGGCTCAAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTC 

CCCTGCACCGAGGGMCAACTCGAGCCTTTGAGGGCTTCCTCCAANCCTACCCACTTTGG 

CCCAGTCANACCAAAAAAGGGGAAAGNAGGCCTGGCTTCCCCTTGNCAACAAGGGNATTA 

ATTCCMAMGNCCTTCCTGGTTTTTTGGNTGGTNCCTTGAAGGGNAAGGGNAGGAAAAA 

AAMGNCCTTGNAACCTTCGGGGMGGGGNAGGGTTCCTTAACAATTCAAAAGGAAATTG 

GGGGGGGGAATAAAAGGAAAAANGGGGCCAAGNCCTTGGTTTGGAAGGAAGGNAGGTATG 

GCTTCAAATTAATGGCCCCCCAANGGCTTAATTGAACCAAAAAAGGGTNCAAAAGGGGAA 

CAATTCTTCAANAAGGGGTGGGGTTCCACCCCCCCNTTCGGGGTTTTCCCCI I I I IGGTT 

ACCCCTTGGCCCCCGGNNNCCGGGCCCNGCCTTCCTAAGAAANCNTAGGGTTGGGGNANT 

NCCCCCCCCNGGG 

Sequence 1462 

GCGGGCAGGTACATGGATGGGAGCAGCTTCACCAACCCCTGCAAAGTGACTCTGAAGAAG 

ACGACAAGCCCTGCTCCAGTCACACCCGGNAAGNCTGACTGNNTNCCACCGCNACAGNCT 

GA MGGCATT GAGGGNAAMCCTCAATTCNTCGGGGNACCTAMTTTTTTCCCCTTTAAA 

AANATTTTTTAAGTAACCTTTGGCCAACC'AAGGTAAAGGGGGMCTTTTCCAAAACNTNG 

AACCCCTTTCCCTCCAAGNATTTGNAAGGAAACNTGTTNTTTCCCAGGTTAATTAATTAC 

CNATTTCCAAAAGGTTCCAACCTTTNGGAANGGGGTTAAGGGGGAAACCAANAAAAAATT 

TTGGCTTAACCAAGGTTCCCTNAATTAAAAA I I I I I AAAANGGGGGG I I f I I 1 1 I I I I 1 A 

TATAAAAAAGGGNGNTGGTNTNANNCCCCTTTTNGGNGGTCCCCCC 1 mTTTTI I 1 NAA 

GAAAAANCCNTAAGGGGNGGGGGAAATTCCCCCCCCCCCCGGGGGNNCCNTTGGGCCAAA 

NGGGAAAAATTTTTCCNGNATTTAATTTCCAAA 

Sequence 1463 

AGGTACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAGGCGG 

AGCTTGAGGAAACCCGCAGATMGTTTTTTTCTCTTTTGNAMGANTCCCNCNTTTAAAA 

TAACCAAMCCTTACCTTTAAAAAAAAAATTMTTANGTTCCMMTTAANGGGNTTTTA 

ACCTTNAAMGGNMTTAATTTTTNGGCCTTTTTAMNNCCGGGTTTTTAAAAAGNGT^ 

I I I I i I ! I I AAAAACCCGGTTTAAAAA I I 1 1 I 1 1 M I AAAAATTAAAGGCCC I 1 1 I I AAA 

AGNNAA'I I I I I I I I I I I AAAGGNGANGGAAAAAAAAA I I I I I I I I I I GGNANAAGTGAAA 

ANCTTTTTTTNTTAMAAAATNTAMNNTTN 

NAAATGGGNGGGGTNNNANANAAGGAATTTAMAMNTGGGNTNTTTTTNTTTAAAAAAA 
AA ACCAA ATTTTATCCCTTGNTNAGGGGTTTTTTGGGNAAGNNAATATGGAAAAAANAAN 
CCI I I I INCTT 
Sequence 1464 

TTTGAGMGCCAGCGCTCACCCACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACT 

TGGAGAAAAGCACAAACACGACCAGTCCCATCCTGGGCTCCCGNNGGCCGNCCTTTCTTT 

CCTMTNCTTACCGCCATTTTGGTATTCCGGACCTGGCCAATTTTAGGTTTNGGGACCTT 

MAAGNAATTGGMTTGGMCCTTCCAAGTTTTTTAAAAAAGGGGGAAGGGGAAGNAACC 

NMANATTNNCCTTNGAACCCTTGGTTCCTTAAAAGNCCAAANGGAAATTGGGGCCCCCC 

AANAGNCCCTTGNNNGCCAAAAAGGNAAAAAAAAAAAAGNNCCNAACCCAACCTTTGGNN 

CN AATT TAACCCNAATTTAAAGGGGGGAAATTACCAAGGAAAAGGGGGGNCCAAGGAAGG 

CCTTTT CTTGGGCC CCTTGNGCCGGGGGGMTTTCCTTNGGCCAAAAACCCMTTTTTTA 

ACCCAN1 I | | | | | IGGGl I I I I I I I I GGGCCCCCTTTGGCCCAANAAAACCCTTAACCCC 

CCCCGGGCGGGTTNACCCCCTTTCGGGGGGGCCCCNGGCCTTTCCTTAAGGNAAAACCTT 
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AAAGGNNTTGGGGGGNAATTTCCCCCCCCCCCCGG 
Sequence 1465 

AGGTACTTTATTTTTC I I I I I I I I I I I I I 1 1 GTGGATGGGGACTTGTGAATTTTTCTAAA 

GGTGCTATTTAACATGGGAGGAGAGCCGTGTGCCGGNATCCAGCCCAAGCCCNCCCGNTT 

CACTTTTTCNCACACNCTTTNI I I I I CAACCCTTGGCACCTTCTGGGGCNTTTACTTTAA 

MGNGGNCCCCTTTNTTTGGNCATTACATTAGCAGCGGAAMCANCCTTTTANTTNCTTT 

CCCCNTTNATTGGNGMNANAACCTCCCATTTTNTTTTNTTCAAANAAGGAGCATTNGNC 

AAANANCTNCCCAAATNTNNCCTTNNACCACNGNGGGNCNNTANNCCACNNTNCTANCTT 

NAGG NGNNGCCATTTGGGTTNCNCCCTTTGNNCCNGGMNTTCTNCCCTTTTCCNTTNGN 

TrTTNTCCCCNCNGGGNNGTTTTTrTCCAA 

CNCNAANAAAAANGTTNAANCCNAAAAAAAAGNGCCCAAGAG I I M I I I M I I I I I CCCC 

CCCCCC 

Sequence 1466 

AGGTACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAAAAAAC 

AAAAACAGCAAGTTTACATGATCCCTGTAACAGGCCATGGNNCTNCAANNCTTCAGGAAT 

GCCTTNCCNTCNCATTCTGGCCCAAAGNTGGTTGTTTNCNTGGGAATNACCAGGNAGGCC 

ACCMTTCGGTGGGGCCTNTCCTGGGGGGNGGTTCMCCMCCTNCAAGNCTTTTAAGGG 

CTTGGTTGGGGGTTCCCCACCANGAGGCCACCTTNCAATTCTTGGGCTTGGGGGACCTAA 

TTGNGGNTGGGGTTGGGNTTGGGACCTTCCTTAACCTTCCAAAAGTAAGNCAANAAGCNT 

GTTTMCCCMGCCMCAATTTNCAAAAAACCAAGGTTGGTAATTTGGNAAACCANTCCT 

TTTTTTNAAAAATNATTCCAAAAAAGTTNGAAGGAAAAANCCANAGGAAANGGGNCAAAC 

CATTAAANTTAAAATNGGTTTAATCCAAGNAAAAAAG 

Sequence 1467 

CCGGGCAGGTAC I I I I I I I I I I I I I I 1 1 I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGACAAGCCATGCCNCCNNCCCNGGAAANGCC 

GCNCAGAGGGTTCCCTTGGGCAATNTGGTATNTACTGGGGTATCCTTCTTNTGCGGTTNC 

TTTCGGGNCMTTTTTCCGATNNCCACCTTTNNCCTTACAAAGGGNCCCCAANNNTTGNC 

CTTTCTCCANCNCCCCAAAAGGTTNNGTGCCCCTTTTNNNCCCCGNAANNNGGAATTGTT 

NAAGNCCTTNGNMGNGGTTATTGGGGAGGACCATTTTCCTTTCCTTTNCCCCCCCCCCA 

AGGMAAAAAAAATTTTCCNGTGTTNAAGGNGGTAATGNNAAGTAACCTCCX)CCCAACCA 

MTTGGAAAGGGGNTTAAAGMNTGGNTTTTTTNACCTTTm 

CAMNNNCCTTTTTTCCAMGGGGGGGGCCCTTTTTCCTTTCCT^ 

TTNGGAMTTNGGGGGGGGGGGCCCTTGGGAATTTGGAAAAAANCCGGNTTNGGCCCGGN 

CCTTTGGGTCTTTGGTTTTCCCCAAAAAACCCCCTTNGGGCCAAACCCCCCT 

Sequence 1468 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTTCCTCCACCTCCNNCGCCACAATCTTCC 

CA AAGAAA GGGGAGMAGTCTTTTCTTCCTTCGGCAGNCNCGNGTAGTCNAGNGGTTCCC 

CCTTTTTTCCTTAGGNAGGCATTTAAGGGGNAAGTAMGGAAGGNAGGAAGNAATTCCGG 

CCTTGGTTCNTTCGGGGGNAGTAAGNAAAAATTCCACCANAGTCCCGGTCCCCGGAATTC 

CCTTTTCNTTCTTAAGGGGGTTCCTTCCGGTTMGGTTCCGNMTTCCTTAAGGGGTTNC 

AAMTGGAAMAGNGAAAAATTAAGGAMGNMCCMGGTTTTTTGGCCAAAGGAAGAAA 

AGGTTGGGGTGGTTMCCCCTTGGCCCCCCGGGGGGCNGGGCCCCGGCTTTCCTTAAGAA 

AACC7TAGGGTNGGGAAATCCCCCCCCCCGGGGGGCCTTGGC 

Sequence 1469 

CCGGGCAGGTAC I I I I II I I I I I I I 1 1 1 I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGNACAACTCATGCCTCCTNCNNNGAAAGGNC 

GGCCCAAGGGTCCCTGGGCCAATTTGGTmCNTGGGATTTCTTCTTTTCGGNTACATCG 

GGNCMTTTTCCGNTACACCTNCNCCTNCNAAGGGNCCCNAGGTNTGGCTTTCNCNNCGC 

GCMNAGNTNGTNCCCTTTTCACCCGGAATGGMTGGTTMGACCTTGNMGNGGATTTT 

GGGNGGACCTTTTNCATTANCTTACCCCCCCCAANGTAAAAAAATTTCCGGTNNTAAGGG . 
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GNMGMAAGGMCCCNCCCAACAATTGMGGNGGTTAAGGTTGGGTATTTTCCTTTTCC 
mCCCCCTTTCCCC 

MCCTTTrTGGNMTTGGGGGGGGGGNCCTTGGGMTTGGGAAAAAACCGGTTTNCCCGG 

CCTTTNTCTTTGNGTTCNCCAAAAACCCCCTTGGGCCAAACC 

Sequence 1470 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTNCCTCCACCTCCNCCNCCNCNATTCTTC 

CCAA AGAAAG NGAAGAAAGNCCTTTCCTTCTTCGGCAAGNCCCNGNAGGCCAGGGGNTGC 

NCCCTTTTTTCNTAAGGAAGTATTNAGGGGGAGTAAANGTAAGTAAGGAAGGAATTCCGG 

CCTTGGTTCNTTCCGNGGGTAGGAAGGAAAAAATTCCAACCAAAGGCCCCGNTCCCCGGA 

ATNCCCTTTTCCTTCTTNAGGGGTTNCNTNCGGGTTAGNTTCCGGAATTCCTTAAGGGGT 

TCAAAATGGAAAMGGGAAAATTAGGGAAAGTMCCAGGTTTTTTGGCAAAAGGAAGGAA 

AGGTTGGGGGTTGGNTAACCCCTGGCCCCCCGNGGGCGGGGCCCGGCTTCCTAAGAAACC 

TTAGGGTGGGGAATCCCCCCCCGNGGGGCCTTGCCAAGGGGAMNTTTTCCGNAATTAAT 

TCCAAAAGG 

Sequence 1471 

AGGTACAAACGAGTCCTGGCCTTGTCTGTGGAGACGGATTACACCTTCCCACTTGCTGAA 

AAGGTCAAGGCCTTCTTGGCTGATCCATCTGCCTTTGTGGGCTGCNTGCCCCCTGGTGGG 

CTNGCCTGNCCCACCCAACCAGGCNTGGCTTCCCTGCCTTGGCCTTGGCCTTGGCAAGGC 

TCCCCCAGGCCTTAAAGGGGTTTTGGNAAAGGNCNCCAAAGGGGAAAAGGAAGGTTCCGG 

GMGG GGAANTTCCNGGGAACCGAAGGGNATTMTGGGGGGMTTTTTTGGGGTCCTTNC 

TTTT TTGGAC CNTAAAAATTCNAACCCCAAAAAAAAAAAAGGNCCNAAANCCCCCCAAAA 

ANCCTTTTTTAAGNCCCCAAGG I I I I I 1 1 I AATTTTTTGGCCAAAAAAAAACCAAAAGGG 

GGGAAAAAAANTTAAAAAAAAGGGGGGCCCTTTTAAACCCTTTTTTCCAANTTANAhTrAT 

MNAATAAAATANNANAANANNNNNTNNGNGGGTTTAACCCCCTTTGGGCCCCCCCGGGG 

GGGNCCCGGGGGCCCCCCGGGNTTCCTTAAAGGAAAAACCCTTAANGGGTNGGGGGGAAA 

TTCCCCCCCCCCCCCCGGGGGGGGCCCTTGGGCCCAAAGGGGGAAAAAA 

Sequence 1472 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACCGCAGGACAGANCTACCGNCGGGGAGNCN 

CGNGGNAGGGAATGGGGGCATGCAAGTCATGNTGAGAAGNGGAGNGTGATCTTACAAGCA 

NNGGAAGNACGTATGNGGTCCCGNGTAAGNAAGTCCANGTAAGNGNCCCCTTGGAAGNAA 

AGTCCCAAGTANCGACCAGTNTTGNGNAGTAAGTAGGGTGNTGGGNAAATAGGTTGNANC 

CCATNTCCGGGGGTCCTTGGGGGGNCCTTGGGTAAGNCCNCGNCCAACCAACTGCTTCTT 

CCTNCCCCAATGGTTTAAAAMTTAGGCCACCCCTTTTTTMGGAAAAAAAM 

CAACCAAAAGGGTTCNCNCCCAATTCCCAACCATANAAAAAAAAANATAGATANACACAN 

AAAGGGGAAAAGGTTACCCCTT 

Sequence 1473 

AGGTACTGGTGTGTCCGGAATCCTACCCACTGTGATGACAGTGCCTGATAGTTTCTTCTG 

CCTTTCTATCCCAAAACGATTGGTCAGTTTACCCMGGTTTGGCAAATGGCCAGCCTTNA 

GNAMTCTTCCCCAGGGGMNCAATNCTTCTTTTCTTAGGTTAAGNTTTGGCCCCTTTAA 

GNCCCAATTCCTTTTNGGTTAAGGTTTTGGNMTNTTTGGMCCTNTNTTTNTT^ 

NTNTNTTGTGGCCTTTTCCCCAGGAAAMGGCCTTCTTMTGNCCTTTCMTTAATGGGG 

AACCTTNGGCCANTAACCCCAAAA I I I I I I I I I I I I NGGGTTTTNCAATTTCCTTGGGTT 

TTTGGGGGTTNCMTGGMTGGGGTTTAAGGCCAAGAAAGGCCCTTGGANCCCCTTCCCC 

NTGGGTTTTAACCAAMTTMGGAAAAATTGGAMTTCCGGGGTTTTTCCCCTTGGGGGG 

CCTTAACCAAGGAACCTTTTGGAAGGGTTCCTNGGTN I I I I I I I I I I I I I GGG I MUM 

TTTTAAAAAAANCCCCTNNTNCNCCCCCAAATNGGGTTGGGGGGCCNAAAAA 

Sequence 1474 

CGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTC I I M M M M M M I GNGGATGGGGAC 
TTGTGAATTTTTCTAAAGGNGCTATTTAACATGGGAGGAGAGCCGTGTTGCCGGCTCCAG 
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CCCAGCCCCGCNTGCTTCACNTTTCCACCNCTTTT^^IX^CTCACCTTGCCTTCTTGGGCT 

TTTCTNCAAGGGCCCCTNNTGGCCTTTCTTCCCCGNACCCCTTTGGTTCTTCNCTTCTGG 

AAANAACCCNCTTCNCTNCCAACCAAACCTNNGCAGNCTCCCATTNNCTTTCCCCGGGCT 

TCCCCCNTCNCCTTAAGGTTCTTGGTTCCCCTTGGCCGGTTCNCCTTCTTGGTTCCCCCG 

GGGTTTTTCAAGNAGNAACAAANCNTTNCCCCAAAAANGCCAACAAAAAAGGCCAAGGTT 

I I 11 I I I ICCCCCCCCTTAAAAGNGGGNNTGGGGGGGAAGGGGGAAAAGNCCAAAAAAAA 

GGAACCTTCTTGGTACCCCTTTTTNGGGGCCCCGGGCTTCTTAAAGAAAACCTAAGGGTG 

GGGGAAATNCCCCCCCCCCG 

Sequence 1475 

AGGTACTTCAATCCTGAATTMTCTTTAAACACTTTCAAATATGGAGATTAATCACCAAC 

TTCTTAI MM I GGGCCAGTTGGATTCAA I Jill I ATTTAACCATGGANCTCTTTCNTAT 

TATTCGGTTACCTNGTNCGGAMTGGCCTAGGAAAGNAAAGGGCCTTCTTTTTCNCGGAG 

GGGTTNCGGGGAACCAAGNCNAGGAACCTTTTTGGTNAA I MM I AATTGGTTTCCCCTT 

TTTTTTGGNTGGGGAATTTCCTTTAAATNGAACCCCCAGGTTCCGGGAACAAGGGAATTG 

NAGGCCCAAAGGGTTCAAAAAAGGGGGAAAACCNTTTTTNTNAAACCCCCCAAGGGTATT 

GGCAAGTTTGGGACCACCATTAATTAA I M M M M I I CCTTTGGGGAAAAGGGAATGGT 

TAATTTTTTCCCCCCAAAATTCCAAAANCCCCTGGGGTGGGGAAATTAAAGGGAAMTTT 

TCNTGGAAAAAACCAACCCAATTCTTAA I M M I IGGAAAAAGGCCTTAAl M M M M I 

TGGGAATAACCAACCC 

Sequence 1476 

AGGTACAAAATTTTATTAAAGGTCTTTAGAGAGCAACATCCAGACTCCAGAATACAGCTG 
NGNAGGAGACCCTGTTATGCTGTGGGGACTGGGCTGGGGCATTGGAAAAGCNCCTCCNTC 
TGGGCCTTCCCAACNCCCTTTANTTGTTTCTTGNAAGAAATGGGNGGGGNTTTGNTTGGG 
NGNCNATGATNATTANCTTTCAATTCTTTTTTAGGGGGGATTTTNACCAACCAAAAATTG 
NCCTTTCNAAACCGGTTGNGGNNTTCCATGTGNCCAAGGGNGGNGGCCTTTTCI M M lA 
NGGGGGNCCCCCAAANTTCNTTTTNMCCCCCAANrrANTTTTGN M M I CCGGNNAANGT 
CCCGGTGTTTNTACCCGGAAAATTCTCCGGNGGGGGNANCAAANGGNGNNTTTNCCTTGG 
TTG NGTATTNCCCCAA ACTAATTTAAAAAAAA AAAC 
Sequence 1477 

CGGCCGCCCGGGCAGGTAC I I M M M M M I M M M M I I GATGAAATGCCAGGGTGA 

AAGGGATAGCCAAATAGGCTAAAGCACAAGTGCCACTCTAGGTTAATTTCGGCAGGACNN 

GCCCCAAATAAAAGGGTCCCACCNGACCANNTTACCCAATNCAACAACCATTTCCCGGNT 

TTCGGGG 

Sequence 1478 

AGGTACCTTAACTTGAGTTACAGGGCTGGTCCCTCTCTTTTCAI I M I I ATCCCAGTAGG 

TGAGACCGTCCCTGCTGTGTTCGGTGGCTGTGGAGTTGATGGCATCTTTGCTCCAGGTGA 

CACCTGCATGCTGGCCAGCTGAGCGGCATAGAGCTGCTGCAAAACAGGGAAGACAAACAT 

TGATCTCTTTAMTGCAGCTGAMTTGAGTTTCAAMGAAAMACTTACCATATTGTTAG 

ATTTCTCAGACAGCCTTGTMmCTTTTTCACTATTTTTA 

GGCAAATAATAAAACAGAAAGAAAAAGGACAGGTATGGGAGCCATAGTTCTGTTTCTGGT 
CCTTCTAGC AACMCTTTATGATCCTGGACMAGGATTTGATCTCTTTAGAATTCAAGTT 
I i I 1 1 I I I I A 
Sequence 1479 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAACACCTGCTAAAGACATAGC 
TTAATAAATATTTGTTGMTGTTGTTGMTGAAGAGCATTTTCCTGCTCATAMTTTCTC 
ATTACTTAACGAATGTAGAATAAATTCCTTACTCTGGCAGACTGGAATATGTAGTTACAG 
TTTATTCTAGMTTTTTGCATTC^ 

CCTAAGTGACCCAGGTGAGTCCAGTGTCCTGCTGATTTCACTGCAGGGCAGGCATGTGAT 
CCAGGCTGGCCACAGTGGTCAGTGTGGACAACTGACCCTTGTTCATCTGTGGAGTCTAAC 
AAAAGTAGTGCCCCCTTATCCTCGGTGGTTATGTTCCAAGATGCCCAAGTGGATGCCTGA 
AACCGTGGATAGTACCTGCCCGGGCGGNCGNTCTAAAACTAAGTG 
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Sequence 1480 

CCGCGGTGGCGGCCGAGGTACTTAGCATTGATCAAAGAMTTTCAAATTACCGATCAATT 

GGGTGGGGAGAGGAATTTTCATTGTCCAAGCACCCTCAGGGAACAGAAGTCAAAGCAATA 

ACATATTCAGCMTGCAGGTCTATMTGMGAGMCCCGGMGTTTTTGTGATCATTGAC 

ATTTMGACACCAAAAMTAAMGACTCCTACGMGMCTGTTTTTGTTTTCCTCTTCCT 

TTTGAGAAGACACTATGAATTAAATTCTACAGC I I I I I I I I GATATATGGAAATTTGTAG 

MCAGAMTATTTTAGTTAAAGTGTGACTTTCAGAAAGGGAAAATCAGGGCACAGCCTTG 

GTCTGTGTTCCCCAAATATTCACACTTTAAAGAATTCTTAAC 

Sequence 1481 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGATTGTCCTTTATGCATTAAATTCAT 
GCTTTTACATACTTGTGTTTCCTTTTCATTTC 

TAGAGAAAAGGGTTGGCCAGGTGCAGTGACTCACACCTGCAATCCCAGCACTTTGGGAGG 

TCGAGGTGGGTGGATCACTTGAGGCTAGGATTTTGAGACCAGCCTGACCAACATGGAGAA 

ACCCCATCTCTACTGAAAATACAAAATTACCCGGGTGTGGTGGCGCATGCCTGTAACCTC 

AGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTTGAATCTGGGAGGCGGAGGTTGTGGTG 

AGCCAAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTCTCAAAA 

AAATTAAATAAATAAGGCTTATGTGCAGTTATTCTCAATGAGCTAAAAAGCTTCCTGAGG 

CTAGAMTTCTTGTCATTTTTGGCTGGGA 

Sequence 1482 

CCGCGGTGGCGGCCGCCGGGCAGGTACTATCTATTTGGCCTTGGAAAGTTACTCTCTAAG 

TCAGGGCTTCTTTTCTTNAACTTTGGTGTGAGACTCCGCTGGAGAGCCGGTTAGAAAAAC 

ACAAGTCTCGGGCCCCACCCCGCAGAGCCCCTCATTCTCTAGTTCTGGGTTGGGGCCCAG 

GAGTCTGCTTTTCTAGCAAGCGCCCAGATGTCACTGATGCTTACAGCTCTCANACCACAG 

TTGGAGCAGNGATTTTTAAAAGTCTmCATTTGTAMGAGTNGTTCTCCATC 

TGACTGNGACGACTGAGAAAATGCATGTATGTAAAGTCTGCANCTGGTGACATTGTACAC 

ACTNAGCAAATGGCCTTNGGTGTTACTGTNANTTTA I I I I I ACTAATTATTTNTNNCACC 

NACAAATTNGGANCTGCTNCAATCGGTNGGAATTTGGAAATTGGGC 

Sequence 1483 

GCTGCCATCAGCTCCCTAGGAGCTCTCCCTCCAGGAAGGGAATGTGTCCACCGTCAGACA 

CTCAGACCCAGCATGTGGGGACAGAGGCTGATGGCCTGTCTGGCCATTCCTCTCAGTTCC 

TCTCCTCACTAtaCTTGTGTCCTTGTGCAAGTCACTTACCCTCTCTGAGGTTCAGTTCCCT 

CCTCTTTGAAGTGGGTTTAATAATAAGTACCTGCCG 

Sequence 1484 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCTTTCGTTAGAAGGCAGTGACTCCTTTC 

TGTGAAGCCGATTTAGTGAAGTGTCCTGTGCAGAAAAGAGTCCAGGGCTGTCAGTTAATT 

TCTTCCGGCCACTGGAGTTAGGGTTTGAAACTCTGCAGCTGCCTATTGCACTTGTGAAAA 

GGTTTGTATGTTCATCACTGCTGGCTGGCTCAGAGTTGGGAGTGAATCCTCCAAGGGATA 

AGCTTGGAGAACTTTCTGAACAGTCAATCTGTAAAGGTGTCTGCAATCCCAAGGCCAATG 

GACTAGATTCTGAAGGCTCTCGGTGGACCCACTGTTCCTCTCTGTTTATTAAGCl MU G 

MGGAGAGAGATGAGGGCAGGACATGTGACMCGGTGCTTTTCCTTATGCTTATATCGCT 

CTCC AAC AG C ATCCTT 

Sequence 1485 

AGGTACATGCAAGTTGCATGATTATAATGACGTGATCCTGGGATTTAAGTTGATTATGAC 

AGGAACAGAAGGAACTTGGAGATTAGGGACAATGAAAAGGTGGTAGTGAAAGGAGTGTTG 

GAGTTAAGTTACTAGATGTCTGGAAGACAGACTGTGGTGGTCAGATAATGAGATAGATTA 

TGGAGGGGTTACAGTTTTTGGTAATAACGAGAGAGATCTAAGGTATGACTGAGAGTGAAT 

GGATGAGAATAGCAGAGAACAAGGTCAGTGGAAAACTACTCTTCAAAGAACCTATAGTCA 

GGGTGTTGAAAGATTAAAAATTGCTAAGAATTAAATCAGGAAGTAGTGCTGCGGGAAATG 

AAATGAGCAGTGAACTAAACTTACTGAGAAATGAGAGGGGATGACCCAAGGGTTTGTAGA 

TTTTGTAGATGATAGCA 

Sequence 1486 
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AGGTACATGACATCCGACGAAATGAGACGCCACCTAATTGATTTCCGGGAGTCCGCACCA 

GGGGCCCTCAGGGAAGAGACCCCGCAAAGATCCTGGGAGACCAAGGTGGGGACCCTGGTG 

AGAAGAGAGAGTTCAGGGGAGTCTCTCTTCATTGCCCTTCTGCTAACCCAAGCATTAATT 

TGCTAAGTATTTACCAGGGGAGTGGGAAAAAGAGTTGAGCGGGATTCTCTTAGGCTATGA 

GAGAGTCAGGCAGCCCCCAAGATAAAATAATGAACTAGAAAATCTGGAACCTTACTTCTC 

TGGGAATNTTACCTATCTGGCACCGTGGGAAGAAAGAAAAAAGGCTACTGAGTACCTGCC 

CG 

Sequence 1487 

GCGGTGGCGGCCGAGGTACTGACCTCTGCCAGTCAGCTGCGTAACTCCAGGCCCTAGGGT 

GCCCCGTCTGTCCAGCCAGGGATTGCATGGATATGCTGTGATCTCCCTTTTGGTTCTGAT 

TGAGTTGGACCTTGTGGGAGGAGAAACATAGATGTTGATACATGAACACATATGTTGGAG 

AGAGMAGTTTTATCTTGGCATAGGACTTrTAAAACACAAGGTAATT^ 

GGGACCAAAAAACACTCAATATGGGAAAAATCCAAATTCTGCCAAAATGTCTAAAGAGGT 

TTATTCTGAACCAGTATAAGTGACTGTGGTCTAGGTTACACAATTTCAAGAGATCCTGAT 

AMGCGTGCATGAGAGAGTTGGGCTACAGCTTGGTTTTACACATTTCAGGGAGACAGGAA 

TTGTANGGTAAAATTATGGCACAGGACTTTTAAAATGAAGCTGTGAAAGTTTACAGTCCA 

TAGAGAATAAAAATCTAGAAGTTT 

Sequence 1488 

CCGCGGTGGCGGCCGAGGTACTTGACACGACATATGGTAAATGAATAAGACAAAGGCTCT 

GATGGCTTCTCACAGCCCTGTGGATCAMCTCAGATTCTTTTCTAGAACCCCAAGGCCCT 

GTCAG TCTCA CTAGCCTCTCTCCAGACACACCAGCCTTTTTCTACCTTTCCAACATCCCA 

AGTTCCTTTTCTCTTACAGGACTACATACACTCTCTCTTCTGCCAGAAACCATGTTCTAC 

CAGCTMTTTCCACTCAACTTTTAAGTCTCANCTGAAATGTTACTTCCAAAGAGAGGCCT 

CCACTGAACCCCAAGCCTGGGGTTCACAGCACCTGTCTCCATAACTACATAATAATCTCT 

CTGATGTTTAGGCTGGGCATGGTGGCTCACGCCTGTAATGCCAGCACTTTGGGAGGCCAA 

GGCG 

Sequence 1489 

GGGCGAATTGGAGCTCCCCGNGGTGGCGGCCGGGCAGGTACCCAGAGTTGCGAGGAGTTT 

TTTMCTGATTTAGCCAGGTGGCAANCATNAGTGAATGGATGAAGAAAGGCCCCTTAGAA 

TGGCAAGATTACATTTACAAAGAGGTCCGAGTGACAGCCAGTGAGAAGAATGAGTATAAA 

GGATGGGTTTTAACTACAGACCCAGTCTCTGCCMTATTGTCCTTGTGAACTTCCTTGAA 

GATGGCAGCATGTCTGTGACCGGAATTATGGGACATGCTGTGCAGACTGTTGAAACTATG 

A 

Sequence 1490 

GATNAGCTCGATATNGAATNNCNNCANNCNNGGGGANNCANNNGNACAAGAGCGGANNCC 

NNCGCAGAGGAGCTNCAANTTTACACACTGTTTAAATGAGGGAATANGCNGCAGCGCTTG 

GATGTAACTGAAGAAGACAGTNNGAGCNNCAAGGAGGGACAACCACGACCTATGAGGACA 

CCATGCCAGAGAGGCCTGGACCCCACGCTAGGCTCAGTGCCTGTTATACTCTTGGGACCC 

AGCGCTTTCCTTCTCCATCACGTGGCATACTTGGCATTATTTGTTGNTTAAAATATTGCC 

CTTAGTTTTNACCTTTCNTMGGAGACACMGGNNGACCTTTGNGACATTAACAGTTGCC 

CC AAGTN GGGGGNANANAMNMTTTTTGGGGGNGNAM 

AAAI I I I I I I I I NAAANTTT 

Sequence 1491 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTGACCCTTGCTGGT 

GCTTTCATGTTTTGTGCCGAGGCMTTAGACTTTGTGCTGMTNTGTTTGTGCTGCCACC 

TCAGGGAAGGGGTGGAATGTGCAGCGTGGTTTCCATTTGACATTGTTTTCCCTGAGAGAT 

GGGAGGGCTGAACGTTACCTCTTGACAAGTCTTAGTGGACAGAGGGGCCCGGATACCCAA 

GCGCCTTAGTTCTTAGGGCTGGGTATTAGTTCATTTTCACACTGCTGATAAAGACATACT 

CGATACTGGGAAGAAAAAGAGGTTTAATTGGACTCCCACATGGCT 

Sequence 1492 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTACCCTTTACTTTTTCCC 
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CAAGACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACC 

CACCTCTCCCAAGAGCAGACATTAAACATCTTTGTGCTTTGAAGAGAGTGMTTTTGGAT 

AGTCTTGTGATTCTCAGACTAACTTCCAGAATTATACTTTAACCCCTTCCAGATATGGTC 

CGCCTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTA 

ATTGTCMCCTCCAGTGTTGACTCTAGAMTATGAGGAAAGCTTTTCAGTTTTTAAAATT 

GCCATTTAAATTTAGTCTATTMMACAMCCTAGAGGTCTTGGTGCAGTTGATTTCAGA 

GTTTATTMTTTMGTGGTCCCMMGTATTACATCTTTTATATTCTGGAAGAAAAGAAC 

TGTGAACAAATTAGA 

Sequence 1493 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACTGTGGAGGT 

CCCAGGAGTGCTGGTGATGGGCACAAAGTTCCGATGGGTGAAACCATTGACATAGAGACT 

GCCTCTGTCCAGGAGGNANGGGCCCAGCTCGATGATGCCATGGGTCAGTTTGCTCAGCTC 

CCAGTATAGCTGCTCTCTGTCCAGTCCAGGGTTTAGAGGGTCAAGGCGGTGAGTGCAGAT 

GGTGTCCACGCCGGTGGCTGNCCCACGTTTTTCAGGCCTGAGCAAGGTCAGTNTGCAGCC 

AGAGTACCTT 

Sequence 1494 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATTTGAATTTGTAATGA 

GTCTGATGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGAATATC 

TTATATCCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATTATTATGAC 

AACTG AACATGCTTATGCATCCCTCATGCCTTTA I fill I ATTTTGGGAAATCTTTCAGC 

TTCAGTTTTTGCTGATATTTATGTGATTCTTTGTTCTGCAATTCAAATTTCTGGGAG 

AACAGTCTCCTTGGTTCAGATTACTG I I I I I I GACTAGAGCTTCTCGCTTCAGATTCTGT 

CATMGATTATGGCTTMCCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACA 

TTATTT 

Sequence 1495 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGCGGGGAGTTTCACGCGCGTATGC 
TTTGCCCGCCATGGCCGCAGTCMGGNGCCTTTGGAGTTCCATGCCAAGCGGTCTTGGCG 
CCCCGAGGAGGCAGTAGAAGATCCGGACGAGGAGGATGAGGATAATACTAGTGAAGCCGA 
GAATGGGTTCTCCCTGGAGGAAAGTGTTACGGCTCGGAGGCACCAAGCAAGGATACCTTA 
TGCTGGCTACTTTGGATGAGAATGAGGGAAGTGATAGATGGAGGGCAAAAAAAGGAGCAA 
TCCGATTGACCTTTNACCAAGGGGGANTTTTGGAAGC I I I I 1 1 I I I IAAAAAI I I I I NTT 
TTTNGGNGAAATTTTTNNAAAAACI II I I I I I NTTTNAAANNANATTACCCCCCTTTTAA 
AAAAAAAAAATTTNCCCCCCAAAAAAAAAANGGGGNAAAANANCCCCCCAAAAANAAAAN 
ATTAAAAAAACCCCCCCCAAANGNTTCAAGGGGCCC I I I I 1 1 I I I I I I NGGGGNAAAAAA 
AAAAMTTNGGCCCCCNCCNCNTTTTTTTGGGAGGAGAGGGGGGNCCCCCNCCCCCTTTA 
AAAAAAAAAAAGGGCCCCCCCGGGGGGGGGGGGNANTTTTANAAANI 1 1 I I I I I ICCCCC 
CCCC 

Sequence 1496 

CGGGCAGGACCATGGGAAATAAGAGCNGGCTNNNGGCATTCTGNGTANGGAGCCTGAGCC 

MACTCTAAAGCTGTCTTTATAMGGGAGGTCATGTGATGGCCAGAAATTGCCTTTGCTT 

CATGGTGCACTTGGTGGGGAGTCAGGTGTGGGGTGCTGGGTTTCACATCATCCCATTTTC 

TTTTNNGNNTTCAGACCTGCAATGCTTCTTTTGCAACCCGAGACCGTCTGCGCTCCCACC 

TGGCCTGTCATGAAGACAAGGTGCCCTGCCAGGTGTGTGGGAAGTACCT 

Sequence 1497 

AGGTACTTTTNGAAGTAACTGGACATGNGGGAGGNNAGGGGAANGGAAGTATTGNTATGG 

ACTGAACTGTGCCCCCAAAATTCATATGTNGAAGCCNTGAGCTCTGACATGATTGNATNT 

GAAGTCCTAAAGCCAGGAATGAGGAAGGCTGTGAATGTNCATTGTTCCATGCAAGAATGA 

CTCTGGNGNGGGCTATTTAGAGATCATGAGGGATACTGCCCCAGTTTCCACAGGCCAGAT 

GGNCTCCAACAAAAGCCACGGGGGAGTCACCCCTGCCTGGCAGATCTATCGGGTCAGGAC 

CACCGCCCAGGGGGTCCTGGAGAGGACAGTATAGGACCAAAGAGGAT 

Sequence 1498 



m 
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CACCGNGGTGGCGCCCGAGGTACTTGGCCAAGCGCTCAGATCGGCAAGGGGCACCAAGTC 

TTGATCTGCCCAGTGCACAGNCCCACAACCAGGTCAGCGATGAAGGTATCTTCAGTCTCC 

CCCGAACGATGAGACACCATGACGCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGA 

A GAGACTCG GTCACCGGAGCCATTCTGGNTTGACTTTTTAGCAAGAAGGNANTTNAAGGA 

Nl I I II I n I ANCGNCTTNGGNAATCCTTTTGGGGGTGGGGCNCNTGGGNGAAAAACTCC 

CCCCCANCCNTNGNGATTCNTTNCNNGNGGGNNGAAAAATT7TTTGGNNAAAANTNNCCC 

ATMNNTTTTTNGTGMNAANGGANTTTTTTNANAMANCCCCCTGGGGTNTNTNTTTTN 

NNNAAAAANATTTTTTNTCTTCNCNGGGGGGGNCNNTTTTATAAAAAAAANGANNANACC 

CCCCCNCCGTGGNGGAGAAAATTNTTTTANATTA I I I I I I 1 1 CNTCCCCCCCCCGGGCCT 

TGGGGGGGGGGGCCCCCCANCCC I I I I I I I II I I I I I I 

Sequence 1499 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTAA 

TTTGTGT CATC CCATCAGCAATGAAGGTCCCTATCCAGGGTCCTGCTTGGAGCAGCATTT 

CATGTTCTTTTGCTGTTTTGTGCTTTGCCGATTTTGGATTTTA 

ATTTAAAAAACTCGTCACCTTTTGGAAATGCCCATTGCCGACTTGAA 1 1 I I I 1 1 GTATGA 

AGTCCCTCCTGATTT TGTGTGTGTGTGTCTGTGTTTAAGCAAGCGTTCGGTTGGTATAGN 

I I I I 1 1 I I GTTTTTTTMTTTAAATTGAAGGTAGCTGCCTCCTGAAAGCCAGCATTAAGC 

CAGAACACCCAGNTTCAAGCAAAAGACCCACCTCTCTGCAGAGGCAAAGTCTACTTTCTG 

GTACCT 

Sequence 1500 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAGTAGAGACGGGGTT 

TCACCATGTTTGCCAGGCTGGTCTCGAACCCCTGGCCTCAAGTGATCCACCCGCCGCAGA 

CTCCCAGAGTGCTGGGATTTCAGGTGTGAGCCACTATGCCCGGCCTMTACGTGGATTTT 

TAAAGCTTCAGGTTCTGGTTCAGMGTTTCCTGGGTCTCATTAAAATAATGAGGCACTCA 

GAATTGGTCTAATAAAAATMCGACCATTTCTTTCTACTTCAGCTNTTTCACAAACTTTT 

TTAATGAAAATGACAAGNGAGGNCCTTCAGTAGGGGCATTTTCAGGGGANAAAAATAGCG 

GGNA GACCTGAMCCTGGGNTAGGNAGTTTNTTTTTATTTTTTGAACAAGMGANNAATT 

TTTTCANAAGACCCTAAAAMTTNTTTTCCCAA 

GGNGGGACCCCCTTCCTTTTGGGNNTTTTNCCTTCCCCCCT 

Sequence 1501 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACTGACCACAGCTCTTTCTTCAGGGACAGACATGGCTCAGCGGATGACAACACAG 

CTGCTGCTCCTTCTAGTGTGGGTGGCTGTAGTAGGGGAGGCTCAGACAAGGATTGCATGG 

GCCAGGACTGAGCTTNTNAATGTNTGCATGACCCCCAAGCCCCACANGGAAAACCCCGCC 

CCCGNGGACMTTTGTTTTNACCANGTTTCCCCCNTTGGNNNNAAAAAATTCC I II I I II 

TTTNCCNNCCCCCCCNCGGNGNNCCCNMGGGGGNNTTTTCCCCNNTTTTTTNANTNNNC 

CCCCCCCCCCCCCGGGGGNGGGGNGGNCNCCCCNCTTTTTNNNAANNANI I I I I I I I INT 

TNNAAAAANCCNCNNNTTTTTTANAANGGGGNTCCCCCNNAATNTNGGGGCCCNTTATCC 

CCNGGGGGNTAAMNAMNTNTTNNCCMAAAGGGGGGNCCCCCTCCCCCTTTTTNAAAA 

NAGGGGGCCCCCCCCGNGGGGGGNNGNGAANTNATAAAANI I I I I I I ICNCCCCCCC 

Sequence 1502 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCCAGCG 

CAGTCTGGGCCCCAAAGCAGTTTCACCCTTACCTAACTTACCTAACTCTTAATATAGCCT 

MACTCACTGAAAMTAAGCTMCTTCATTTCACCTTTNGTAGCATACACGTAGACTCAG 

AGTNTATACTGAAATATAATGGGAGGCTGTTATNAAAAAAAACCNNCCCTCNAAAAAAAA 

AA AATNTNNN CNNCTNAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGGGGGGGGGNNNNG 

GG I T I I I n rNATNNTAAAAAMMMTNNGGGGGGGGGGGGGGGGGGGGGGGTTTTTTN 

TNNTNTNTTCCCCTNNANTCCGGGGGGGGGGGGGGGGGGGGTNGTNGGTTTCCTTNTATT 

NTCTNTNNCTANNNCACNCCTCCNANACCCGCNNNNGANNNNAANATNNNTTNNAATTAT 

NNNNATTTNNNAAAAAGNGGGGGGGGGGGGGGGGGGGGGGTTNCTTTCNNTTTNNANTAT 

AANTANNATAGTNTCTNNNGNANNNTTNTNTNNNNANATNTNGGNNTNCNCNCNTNCNTN 
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CNTCTTNTTTATAAAANAAAAAAA 
Sequence 1503 

CCCTCGCGGTGGCGGGCGAGGTGCATCACCCTGCTGAGGGACATCCAGGACAAGGTCACC 

ACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTG 

ACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGC 

CTGGTGGAGCMGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCC 

ACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACA 

AGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCC 

CAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGAT 

GCGCTCAACCAACTCTTCGAAACAGC 

Sequence 1504 

CCGGGCAGGTACTCTTGACAAAGCCTCCCCAACAAGGTGAAATCACTCTCGGCTCACCAC 

TGGTGGCAGAAATACTCTGGGCCTGTCTGTTCTGAATGGTCAGTTTCACTGTAATCCTCA 

GGCCCTTCCAGTCACCCGTTGCC7TGGCMTGTCATCACCAACTTTTTTTGGAGACAGAC 

CCAGGGGGCCCGATCTTGGGGGCCAGGGCAGAAGTTGGCACCGACTTTNCCTTCCGGGNN 

CCCCTMGGTTAAAA^CTTTTNTTTTTTTGGGGG 

GGGGCCCTTTNNTTTTTAMNNAAACCCCCNNTTTTNGGGAAAAACCCNAAAGAAAAGTN 
CCCC CTTGGGCCCCCCCCCNNCCCAAAAGGCCGNNAAAAACCCCCGGTTTTTTNGGNGTT 
TTTTAAAAAAAAAGGAAANCCCCCCGCNNCGGGAAAA^^^TTN^ 
CCCCCCCCNCGGGGGGGGGGCCCGGCCNCNN I I 1 1 M I I 1 1 1 I I I I I I 
Sequence 1505 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGTGGTAA 

ATGCTTAACTCCACCTATATAAAAATCAAAAACAGGCAAAATGAATCATCAGACTACAGC 

TTACCTTTAGGGGGTGTGAGTAGTGACTAACAGGGGTCAAGATGAAGGTTTAAGGATGTC 

GCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGAT 

CACTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACTAA 

AAATACAAAAAATTAGCCGGGCGTGGNGGTAGGCGTCTGTNGTCCCAGCTACTCGGGAGG 

CTGAGGCAGGAGAATGACGTGAACCCAGNAGGTGGAACTTGCAGTGAGCCGA 

Sequence 1506 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTGCATTTATTTTAGCCCATGGAATAA 

C TGTG CTGAGAAACCACAGAGTCAATCAGATTCAAAATGTTAAAATCCTTCCTGCTTGGA 

GTTTTCCGTCTTCACATCAMGCATTTCATGCCCGTCAGCAACTTTTTAATGCATTTGCT 

CCTCGTTTGCACAATTTCCATTTAAAGACTTTCCTTGGCTGACTTCTCTGATGAGGTTTC 

CTGCTTGCCAGGAGAGCACGCTAATGCAGAAATTACAAAGGGGGCTTCACGTCCCTTTTC 

CGGAGGACCTGATATTTCAGATMTTTCCAGCTTCAGTTTTGGAGAAACGACTGTTCTTT 

GCACCAGGGGAAMTAAACTGATTTTCAGTGTAAGCAACCTTTCTGCAAGTAGAATGGGG 

ACTGTTGGGMTGGAGATGAAGACTTCACTTCATGTTATTCATTTCCTTAC 

Sequence 1507 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I II I 1 1 1 I I I I I 1 1 i I G 

GTTAGGTTMCAGAMCAATGGGGTTATTTCCGGTAGTGGCMCAGCACGAACTAGTTTT 

CTTGTGGACTMTGGCCCGAGAGTTCCTGCGATTTGTGGATTATTTCCCTTAGATGCAAT 

CGACCATTTCGTTCCAATAAGAATTAATGCTACCACGGGGCTTACAAAGAACCCCTCAAG 

TCTTCCAAATCTGCCCATGACATCAACCTNTGCTGCGTAATCGGACCTGCACCCAACCCA 

GGTTT 

Sequence 1508 

CCGCGGTGGCGGCCGAGGTACGCGGGAGAACAGCTCAGAAGGAGACCCACAGTGAGCAGC 

TCCCCTGTGTCGGCGGGCAGGTCGTCCCTCAAGTGTTCAGCTCTCAGCAGAGAAAAGGCC 

CTGGAGAGGGTGACTCCTCTCAGCTCTCAGCAGAGAAGCAGCCCTGGAGAAGGTAGCTTC 

TGTTCGCAGGCAGATTGTCCAGAGGTCCTGCTGCTCTCAGACGGGGCCCTGGAGAGGATA 

GCTTCTATCCATAGGCAGG 

Sequence 1 509 
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CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAGCA 

GTGGAAGCTGCAGAGGGCCGAAACAAAAATGAAGTTTTCTACCAATGTCCAGACCAAATG 

GCTCGAAATCCAGCTGCTATTGACATGTTTATTATAGGTGCrrACTTTTACTGACTGGTTT 

ACCTCTTATGTCAAAAATGTTGTATCAGGTGGCTTCCCCATCATCAGAGACCMATTTTC 

AGATATGTTCACGATCCAGAATGTGTAGCAACAACTGGG 

Sequence 1510 

CTCCCGCGGTGGCGGCCGCCGGGCNGGAC I I I II 1 1 I I I 1 1! I I I II I I I M I IAAAGAT 

CCAGACTAAGACACATCAACAAGAAATTTCCAAATACCAGGTCAAGAATACTTCACATGT 

TTCTGGAGGGAAAGAAAACAGTTCATATACAGNGAATCAGGAATTACAATTGCATCAGAC 

TTATCAACAGCATCGGGAAGATAATGNGAAAATACCTTCAAAACTCTNGAGGGACACAAG 

AAGATGCTGCCTGGCCACAGGAGCGAGACTGCTGGCCTCCAATCAGTCTTNTGGGCAGGC 

CTNTGATGCAATTACAGGGCTAAGGAACCACGGTAACCAATGTGCCATTGGCTGTGCTAA 

AACCCAGNGGCCCCAGGAAGAATCCTGTTGACACTGGCTTGGAACATGAGGCTTAA 

Sequence 1511 

CGACTACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTTTT 
I I M I 1 1 M I M J 1 1 1 II II M I GGCATATAATCCAGTTTATTAAATACAGATGATGGGC 
CAGACATGGTGATAGAGAMTACAGATTMGAMCCAGATCAAATCCTTTTTAAGGAATT 
ATNTAGNGGAAAATATNTCMCTNTNTTNTT^ 

AATCTTCACCTTTCCATTTTGACNGNCGCTNTTTTACTTCAGNNTCCTGAAMCATNTTT 
CCAACAGAAGTTACATAAAMTNCTAATCTTCAAGGGGCTTTCTAAMTATTTTNATCAC 
GTCATTAAAATCTTCTTTCACTAGGCMNGGTTNTGTCTTTATGGGGGCTG 
Sequence 1512 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATAT 

TGAGGATGCGCTCAACCMCTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTG 

TCAAGTTTCAACATTCAGGTCTGTCCCAACAGGCACCACACCCGGGGGTGGACTCCCTGT 

GTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATCTATGAGGGAATTTCTG 

CGGATGACCGGAAATGGGTACCTCGGCCGCTCTANAACTAGGTGGGATCCCCCCGGGCTG 

NAAGGAATTTCGATATCNAGCTTATCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCGG 

GTCCCAGCTTTTTGGTCC 

Sequence 1513 

NGGGGCCGCCGCCCCCGGGGCCGGNCAAAGCAANGGGGGGCCAACNAAGGCNNGGNNNCC 

CCNGGGGGGNANGGAAAAACGNGGGNAANCCCCGCGGCAACCAAAAGGGGNCCAACAACA 

AAACAAGAACCGGGAGGCCCCGGGGGAGNCCANAAAAAANGGCGGCANAAAAGNCCCNGG 

GGGGGGGGGGGCCCCNAAAAAAGGAGGGNGGGANGCCAAAAACCAGCNACNAANCAAAAN 

GCGGCCCGGGGGGGCCGCCCCCAACCNGGGCCCCAGCCTTAAAGNCCCAAGAANCCGCGG 

GGAAAAAACCCCNGGGNCCGGGGGGCCCCAAGCCCNGGGCCAANNAAAAAANGGGAAAAA 

NNCGGGGCCCCCAACCCGCCCGCCCGNGNGGGGAAAAGGAAGGGGGGCCGGGNNCNCCGC 

CCGGNAANNCCGGGGGGGCCGGCCCNCCGGNGCCCCGCCNNNCCCCNNCGGCGGGCNAAC 

CNGGGAACCNGCCGGCCNGGCCGGCCCNCCCGGGGGGNCCCGGNNNACCGGGANGNGGNC 

CGGGCCGAAGGCCGGGGNANGCAAGCCGNCNAACCNCCAAAAAAGGGGC 

Sequence 1514 

GTNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCTTNTAAATCTGGTGTGTTT 

TTTACACTTGCAGCACATCTCAAATGAAATAGCAACATTTCACATGCACAAGACTCACAT 

ATCTTCTGATGGCTATCTTGTCGAATAGCACAATTCTAGGAAGTTCAAAAGGGGGAACAA 

TTGCTTCTCAAATTGATGAAGGTAGAGAAGAATGGTATCAGAGAAACTGATTAAGCAGGG 

ACCTGGGAATCTGAGTTGAGTTCCGGGGCATGTTTGTTCCCTAGCACCATGACCTACATG 

GCAGTGTCTCTTCTCATACTATTNTAAGAGTCCCTGG 

Sequence 1515 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGCCAAGC 
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TGGACTGCATAAAGATTGGTATGGCCTTAGCTCTTAGCCAAACACCTTCCTGACACCATG 

AGGGCCAGCAGCTTCTTGATCGTGGTGGTGTTCCTCATCGCTGGGATGCTGGTTCTAGAG 

GCAGCTGTCACGGGAGTTCCTGTTAAAGGTCAAGACCCTGTCAAAGGCCGTGCTCCATTC 

AATGGACAAGATCGAGCGGCCGCGGGCAGGTACTTCATGAGACAAAATGGAAAAAGGAGA 

ATTAMTTAAAATGCACAACTMTATTTATCTACTACAGACATAATATTTCTCAGTTGTG 

AACTMTTACTATGCTTGGAAMTGCTAGCATCCTCATAAATATTTTGGTTCTATTGGGA 

TACAAMTCTGATTTCGCAMCTTTGCAAAGGCACATTTT 

Sequence 1516 

CCGGGCACGGTACGCGGGGGCATTTATCTTCACTCTGATGAGGGCTCAGACTTGATAACG 

CCCGTGGTGCCCCATCCCTATAGGAGCTGGTGAGATTGCAGCCTGCTGCCTCCCCTCCAT 

CAGCCACAGC TATT GGATTTCCCACCCAGMTCTTTAGGTAAATGAGGTAAGTCCTGATT 

TTTAAMCTTCTTTTGAATCTGGAATCCAAACACCTGAGTGGAAAGAGAAGCCTGCTTTA 

AACTGGACAGATGAAACTAGAACAGACTCTTGGAGACGGCTGGCAGGAAGTGAAGCTCAC 

CTTACCTGGGCTTACCTCACTGGGTCAAATCAGAATTTTATTTTGGAGGGCAGGTTGGCT 

ACTTTGGATATTATCTGNGMTTTCCTGCATTGGCTGGACTTCTAATCTCTGNGAATTTA 

AAAGCC 

Sequence 1517 

CCGCGGTGGCGGCCGAGGTACCTTTTAGTTATTGACAAGGTTAAAATAGCACTCTCAGTT 

TTTCAGTATTACAGAGAGCAAATAGTTTCCTCTCCTGCTCTGNGCAGTAGCTTTTCCAGA 

ACTATGGACAAAATTNGATCAGAAAGAAGATTGATTATTTCCTCATCI Mill ICTTTTT 

TTTGAGACAGAGTCTCTCTNTGTCCCCCAGGCTGAAGTGCAGCGGCATGATCTCAGCTCA 

CTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTTCTGAGTAGCTGG 

GATTACAGGCGCCGTGCCANCATGCCCGC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I I I I I I I I I II I CTTTCTTTC I H I I I I II 

I I M I 1 1 I I I I I I I I I 1 1 I I I I I I 1 1 I I I I I I I I I I I I | I I I I | I GGCANNAAAAANCNT 

NATTNCCATTNGGCCCAAGGCTTGTTAGGAAAGTAAAAAAAGCCCCCNATTGGCNGGNGG 

GNNNGGCTTAGGCAAAACCCCTANTACTTTGCANGGGGCCCTCNAAAAAGCCCGNGGGCC 

CNA AAAAG NCCTTAAANCCNCCNTGNAANAGGGCCCCTTNAAAAAAAAACCCCCCNNNNN 

NMATTTTGNGGCCCCCCCCCGGGGGGGGGGGGGGGGGGGGGGNGGCCCCCCCCCCCNTT 

TNTNISIAACCNCCCNCCCCCCCCCCNNAAANGGGGGGGCCCCCCCNAAAAAAAAAANGGGG 

GGGG GGGNNNGGCCCCCCCCCCCCCCCNAAMMAAAAAAANGGGGGGGNGGGGNTTTTT 

TTTTTNCNCCCCCCCCNCCNNNCNNNNNNNNNGGGGGGG 

Sequence 1519 

CCGGGCNTGGAC I I I I II I 1 1 I 1 1 I 1 1 I I 1 1 I I NCTTGCTTTTGG I I I I I I CCTTCGATA 

TCTTCAAAATCTGTGTCAGAAGAAAAATGTGTTTCTGACTCCTGTAGTAGATTAAAATCA 

GTTGGTATTTCTGGAGCTGCTATCATTTCTTTATCATCTTCTGGAACACCTCAATGTCAG 

AAATATATTCATATATTTACGTGGGTAAMTGGNGGCTTGCCTTCTTTCTTCTAAGACAA 

TTCCTCACGGNGGGCCAAGCGCCCTCCGGGTGCCCCTCCAAACTTNTAGAAGCTTCGCCT 

CCGCCATTTTATAACTTACCTCCCCGCGTACCTN 

Sequence 1520 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAG 

GACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCG 

CTCAACCAACTCTTCCGAAACAGC 

Sequence 1521 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 
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CACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGT 

Sequence 1522 

TACT ATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I 1 1 1 I I 1 1 I 

TT7TGGTGGTMTTGAGCTGGGAMMTTCAGAATTGGGTCATAATTAATGGTAACTAAA 

CAGATTTGNGAATATGGGACATCTGTGGNCTTGAAAACATCAGTATGATTTGNCCCCATA 

TTTCTTCAGCCTGGACAATAGAAACAGACAGGGGAGGGGGGTAAAGTGCANTAAAGTAGG 

CTGAGTGATGTGGTGCTAGCAGCTGGAGTCCAGAGAAGTTCTGACAGTGCAGGGAGCAGC 

CCCTTTGTTCTTTGGAGCACTGGAAGGGCTGAGCTGCATCTGAGGTGTTCAAGCCACCAA 

CAGGACAGGGTAGAGGACTAAGTAGCACATGTCCTCCAGAGCAGCTTCCTGTCTTTGTGT 

GGTCACATCACATCGGGGGAA 

Sequence 1523 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI IIIHl l lll 



I I I I 1 1 I I I I IGCTTCTGGTAAAAAGAAGCATGTTACAATTTTCCCCCCCAATTCACATA 

GGACTCTAAGMCATATTTTAAATCAGTGCTTCCATACAGGMCGAMTCCACTATTTTA 

GAATTCTAAATCTTGTTGAAAAGCAACCTTATCTGAAGAGTAAACAAGAAGATTCAAAGT 

TAAGTATCAGTGCAGTCCAGAGCCCCTAAATGAATAAAACTGAATGTATCTTAAAAATAG 

GATTTGCACACCAGTAAGAGACTTGTTTACGATTCTGGGGAGGAGGGAAGAAACTGTAAG 

AGGGAGAGAAAAGGGAAGAAAAACAAGAAGAAAAATAATNGGAAAAAAAAAAAAGAAAGA 

AAGGTTTGTTTAGCTCAAAG 

Sequence 1524 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATTGAGCTTAG 

TGATATTAACTTGGTAAATTAAGCCAGTTTGTCTCTTTGCTTTATCTGTCTTGTGGAATC 

TGTATCAGTGGAAATTAAATAGTCTTTACATTTGATGAACCCTGTTTAGGTGTTGGAAAT 

ACCCATCTATTTGTTAAAAAGGCAAGGTCCCATGATTTAGTGAATGGGGGATACAGACAG 

CCTTTATTCAAGTAACTGAATAAACAAAAGAATTAGAGAGTGTGATGAGTTTGAATAAAA 

AATATAGNTCATAAAAACCAGAAATGTGATAGAGCATAGTGGCTGGAAGAAAGTTACCCA 

AGTGGCTTGGGTAGTCAATGAAGTTGACTCCAACATGCAGTAGTACGCCGGGTCCTAAGA 

TAGAGATTAAGTCATGGTTTAAATGAGGAACAATCAGTAA 

Sequence 1525 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACACCACTGTGCCTGCT1TGAATCCTTTACGAA 

GAGAAAAAAAAATTAAAGAAAGCCTTTAGATTTATCCAATGTTTACTACTGGGATTGCTT 

AAAGTGAGGCCCCTCCAACACCAGGGGGTTAATTCCTGTGATTGTGAAAGGGGCTACTTC 

CAAGGCATCTTCATGCAGGCAGCCCTTTG 

Sequence 1526 

GAGCNCCCCGCGGTGGCGGCCGNTGTAC l I I I I I I I I I l I II I I I I GCTTNTTAATTGGA 

TGCCTGGAGACMTTCCATTTCAATTACCTTATTGGCATGACNAGATATACAAGGGCTGC 

CAATGTCAATACATTAAGACTGAGCGTGCTGGAGCAGCAGCCAGGGTTCAGGGCACTGCT 

GTGTCATCTGCGCCACGGTGCACAAAGGCAGNTTCAAAAGCATTTCAGCATGATCGCTTC 

CCTCTCTCCGCTCCTGGGGAGAGAAGGATCCTGCACACCACAGGCAAATCATGCTGAAAT 

TGAGGNGGNGCCTTTGGGACTCCCATCCCATCACAGTCTTGGGAT 

Sequence 1527 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATTGGTTGAACA 
AACAGGTGGGCAAAGTGAGGAAGATAAGAAGTCCATCCGTTCAGTTTCCCCACTGCGGAG 
GGAATAACACTGTCTTTCCACAGGTCACAGACTGGGATGAGCAACGGGCTGAAGGCACGT 
TTCCCGGGAAGATCTGAACTGGCTGCATCTCCCTTTCCTCTGTCCTCCATCCTTCTCCCA 
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AGATGGTGAAGGGGGACCTGGTNCTG 
Sequence 1528 

AGGTACGCGGGATGGCACATANGACATCAGCTAGGCTTTTGGGMTCGTTTGTGTTCTTT 

GTGGAAATGTCCTTTAGAAGCACCCATGAAGTAGTGTGTTCAGACTGTGCACACAGAAAA 

CAGGCTCTGCCTTCACATGTGAGACGGTGGACTTTTCCTNTGGACAAAATGACAGCATNC 

TGGCGACTCCACAGTGGAGCTGAGCGCCACTCCCTGTAGCCCGATCTGGGACTGAAACGC 

TTACACCTCTGCCTTAGAAGGAGTCCCCCNTGCC 

Sequence 1529 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACGCGGGGGAATTAGTCCGAGTGGAGAGAGCGA 

GCTGAGTGGTTGTGTGGTCGCGTCTCGGAAACCGGGTAGGCGCTTGCAGCATGGCTGACC 

MCCTGACTGAMGAGCAGATTTGCAGMTTTCAAAGAAAGCTTTTTTTCACTATTT^ 

CAAAAGATGGGTGGATGNGAACTATTAACCAACCAAAGGGAATTTGGGGAACCTGTAATG 

AGGATCTTTTTGGGCCAGMTCCCCACMGAAAGCAAGMGTTTACCAGGGAACATTGAT 

TTMTGGAAAGTAAGAATGCCTTGATTGGTTAAATTGGCACCMTTTGGACTTTCCCTTG 

MTmCTTGACAAATGGATGGGCCAAGGAAAAAATTGGAAANGACCCCAGNACCNGTTG 

GAAGGAAAGGAAAA 

Sequence 1530 

CCGC GGTGGCGGCCGAGGTACGC GGGTGTTCCTTTTTGTTCAMGTCTATTTTTATTCCT 
TGATATTTTTC I I I I I I I I I I 1 1 I I I I GNGGATGGGGACTTGTGAATTTTTCTAAAGGTG 
CTAT TTAAC ATGGGAGGAGAGCGTGTGCGGNTNCANCCCAACCCGCTGCTNACTTTTCAN 
CCNNTTTTCACCNGNCTNTGGGTTTTTMNACCCTNGGNTTTNTACCCCNNTCCTTT 
AANCCCTNNNCCNAAN NGNG GGGCAAANANCCNGNGGGCCCNCTANAAAANANCTTGCGG 
CCCCTGTCCCCCGGGGTTTTGAGGACAANTTTNCCCAAAGCNNAAAAANAGANGTTTTTT 
CNCCNTCNNCTCCTGGNGGGGGGGGGGNGAAAAAAAAAAAAAAAAAN I I 1 1 I I I I ICCCN 
GNNNNGGGGNGGNNNCTTNAAAAAAAAAAAANAATGTCCCCCCCCCCCGNNNNGTGGGCG 
NNAAl I I I I NTTAANAATATTTTTNNCCCCCCCCCCCCCCCCCCGGGGGGGG 
Sequence 1531 

ATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCATTCCGGGTCATCCGCAG 

AAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACAG 

GGAGTCCACCCCGGTGTGGTGCCTGTTGGGGACAGACCTGAATGTTGAAACTTGACAGTC 

AGAAAMTAACTCTTGATGCTGCTGTTTCGGMGAGTTGGTTGAGCGCATCCTCAATATT 

CCTTTTGTTCCTCTGGTAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGTAG 

GTTGGTGATGGTGAAATTCAGGTAGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTTGATA 

AACTGATGACTCCATTTCTGTCACATGGATGTCCACCAACTGGTAGGTGGAGCCCAGCCA 

ATGGGAATGAGGCATTCAGGGTCTTATCTAGAAAGACTTGCTCCACCAGGCTGGGGTCCA 

AATTGGAG 

Sequence 1532 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCACGCACGGA 
AGAAGACGGACCCCGCGATGAGGGCGGCGGCAAGGAGCACCTTCATGTTGCGTTCGGAGA 
GGCG CA GCATC CACAGGCCCCGCGTACAGACTGGTTTGGTAAATGCTAAACTTTTGTGTC 
TTTTGCCTTTTTAAAG G AATTGTTAACATTGG AATTG AGGGTATGTACCT 
Sequence 1533 

AGGTTTTGGGCGATCGTTTTCATACGAMTATTTGAGATGCTTTAGATGTGTGTGCATGT 

CAGCTGCCACCTGAAAGAMGGCCTCATTAMGATTTTCACTGATTMCTCTTTGATTGT 

TCTTGGGATCTCAGATGGGAATTCACGCTGCTTGCTGCAGAGCTCTTGGGCTAAGTGTAT 

TTTCT TAATT ACTGAGAAATGCGTGTTATCAGTAAGCAGTGAAAAGTCTTGAAAAAACTA 

AGTAATTTTAAAAAAAAAAAAAAAAAAAAAAANGTCCTGCCCG 

Sequence 1534 

NCGACTNCTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI M I NIMI 
H I I M I M M M I I I GGNGGNNGTTTTCATTTTCTTCTGGACTGGTAATTTTTACTTCT 
GGCCTTGGTGGTCTTTGAGGGTCTACATCAGCCTNATCTTGTCCACTGCCAGCTCCTTGA 
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GACTTTCTTTCCTTGTCTAGGACCAGTTGAAGCTCATTATTAACCTGCTGCAGCTCCTCA 
CCATTCTCCTCATCGGNGCCTCCATTTACACGAGCAGGGCGTGCCTGCTGCTCACTGTGA 
TCCAAAATCTCCTNGACTTGCTCAGGGGCATNTGTAGCCTCC 
Sequence 1535 

CACTTA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGC(X)GGGCAGGTACTTTTTGGTG 

TTTTTTGGAAMGTTACATTTAGATCTATTCTGAAGCTGTTCATTTTTM 

GTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGTCTGTGA 

ACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAGTCAGAA 

AAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGTGCACGT 

AGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGTGAGAAT 

TTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCACTTATG 

CMGACCCCAACAACTGGCCCTTGAMGGAGCTTTTTACTGGTGGGATGTG 

Sequence 1536 

AG GTAC TAATATCCCTTAATGGCAGAATGTGATAATCATGGAATTAATTATTGCTAAAGT 

AGTTTTCAAATAAAAAAAAGAAAAAAGAAAACAAAACAAATTAAACTTGACACAATCTGA 

CCAMCACGTGTCAATTTACMTTTCMGGTTATTTTACAAAATACCTACATTTACACAA 

TAGGCTCCCGGCAG CATTT CCAGACAMTGTTCTTTTAATTTATCCTGACATGCTATAAA 

TGAATAMTTACACTATTTTAAAAATTATCATCAGTMGTTTTTCCTTTCTCATGGGGGT 

CAANAGCAAAAAAGAAAATNNAGGCNTGCCAAAGGAAGGATTTGGAGAGGGGAAAGCCGC 

CACGCACCACCAGTATAACCTTAAAATAAACAGGGAGGGGGCTGGGGGGAGGGGGTCAAC 

AAATTCACTTGAAACCCTCACAACAAGTGGTGGATGCCTCTTCCAAAGGGGGGNGGAAAA 

GGTTCMNCTTGGCTTTTATTAGGCCCANTTTCAAAAAMGGTATTTTTAAMGNAAA^ 

ATGGTTTTTGGTANGGGGAAAAA 

Sequence 1537 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAAACACCTCATGA 

GGCTGGCCAGAGATGACACCACTGCCCCGGATGCAAGGGGGCAGCCTTGCTTTGGACCCC 

GTTCTGAATCTCAGGCTTCATCTATTAGGCACACTGAGCAGAAACACTCCTCCCTGCCAC 

T TCCG AGTTGCAGCAAAAAGGGTCTMCTATGATTTAMGCAAATMTTTTAAACTTCAA 

ATTTTCATTACTGTCATACTTATGAGGACCAAATAATGTAATGACTAAGACATCAAATTA 

CATGTAATATGAATAAAACCAGTAAGAAAATGATAGTTACTTATAATTGGATCAACATAC 

AAAMGAACGTCATTTGGCCAAAGTAAAACGGTTAAAAAAATAGTGTCTATAAAATCACC 

ATATAAAATCTTAGTATGTACCTGCCCG 

Sequence 1538 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAG 

TTGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGT 

GTGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCA 

CTGGACCTAAGCA GGGC CAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAA 

TGCAMGTTATTTATTTTGTTAGAGTTTGAACAACNGTGATCTGTAGAAAACCAGTCACA 

TTTCCTTMGGATGCATNMTAMTATGAGCTAAATGTMTMTTAGAACTTTGGTTTCC 

AAGCAGATT CCATG CTCTAATTCCCTGGATAGGTGAATATAATAAGATGATNTATCCNTG 

MTMGNTNCTTTTNCTTTGCCCAAGGGGACANGATGGTAAATAAGGGCTACTAAATCAA 

GTTG N CCTTT AAAAT AA 

Sequence 1539 

CCGCGGTGGCGGCCGCCCGGGCAGGTACMAACAMTTCAAAGGGTTAAGAATTTCAAGT 

TGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGTG 

TGACTGCACTGGTCACATTGCCATGMGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCAC 

TGGACCTAAGCAGGGCCAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAAT 

GCAAAGT 

Sequence 1540 

GTACTTAG GGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGGCCCTTCCTT 
TGAAATCATTTTTTCTTCCAGGCCCTTGCGCTCCGGGCCTGGGAGACAGAATGAGAGACA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 
254/467 



GCCCC AAGGA TCTGTGATACACTTTCAGGGTCA I 1)11 1 CCTTGTCCCTGCCAGTCCAGG 

CAC ACCTTT TGTGTTCTCTCCTGMCATACTTTCTCATTTTTCACAATATGGAAAGGCTG 

ATAATTTTTCAAACCTTTAAGTTCTC^ 

TTTTTCTTCTCACATTTTATTATTATAMCATTCMGGGAAACCAAGC 
TAGAMTTTATTCAGCCAGCTGGGGCGGTGGTGGCTCACACCTGTAATCCCAGCACTTTG 
GGAGGCCCGNGGTGGGGTAAGATCACCCTGAGGTCAAGGAGTTTCGAGACCAGCCTGACC 
AATAATN G GTG AAAC ACTTG N CTTNT ACTT AAAAAAT ACC AAAATTT 
Sequence 1541 

GGAGCTAMCCCCGGNGGCGGCCGAGGTACTTACCCTCMTTTCAATGTTAACAATTTCT 

TTAAAAAGGCAAAAGACACAAAAGNTTAGCATTTACCAAACCAGTCTGTACGCGGGGCCT 

GTGGATGCTGCGCCTCTCCGAACGCAGCATGAAGGTGCTCCTTGCCGCCGCCCTCATCGC 

GGGGTCCGNCTTCTTCCTGCTGCTGCCGGGACCTTCT 

Sequence 1542 

CCGCGGTGGCG GCCG AGGTACACAAACGAGATGCTACCTAGGAGAAGGGTATTCTTTTCA 

CTATTCTTTCAAATTTTCTGTATGTTCGMCATTTTCATAGTAGAAAGTTGGGGGGAAAA 

TCTGTTT CATAA ACATTTCCTCAGCAGCAGTCCAGTCTATTGCATTTTAATTGGTTGTGA 

TATCATTGTTTTATGCAATACGTTCTCAACAAGTATATCCTCCGGCAAACTGAACAAGGA 

CCMGTCTGTTCTGCCTACAGCTCTGCTTCCTCATAGCTGCTTTCCAGAACGTGACTCTT 

GCAAATTATCAAGAAAGGGGAACTAATCTAAGGGATCCCAGATCAACAGCCTNATGAAGA 

CCTTAATTTATGNTTCTAANATAAAAGATAGGMGTTTTCAAAAAMGCCCCTGCTTCAC 

ANAGGATCAANANCAGGGGGTGGGCCCTGCTGGGCTTNCACTGGGATTTTTTGAGCATTN 

CTTTCCCNGGNGGCNCGGNAAGGGGNGTGGNGTGAGCCCNAGGGNGGAAAAAAATTT 

Sequence 1543 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGMATATTTCTTTCATTTGCTC 

TGAGCTTMTATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTCA 

CGTGTATTCAC^GCCTCAATGCCATAAGGAAACTCTTT 

Sequence 1544 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACCGACTTTGTCCAGCTGCAG 

GTGCCTGCAGGTCACACTGGCTAACTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTG 

CCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAMTATTTCTTTCATTTGCT 

CTGAGCTTAATATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTT 

TATCMGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTC 

ACGTGTATTCACAGCCTCAATGCCATAAGGAMCTCTTTTAAGAAGTTCTGACAGCTGGT 

CATGTTAGGTATMGMCAGGGTGCCTTATCACTGGTGGATTTCATTTCT 

Sequence 1545 

CGAGGTAC I I I I I I I I I I I I I I I 1 1 1 I I I GGGG AGTTTGTTGTTTTTAACCAAATTATNA 

TAGATGGAAGCATTAGGCAGCTGMTGTTCATTTGCCTTCANACATCATNTCCTATTTCA 

mGCTNGNCTCGCATTAAAAAGMTCAmATCAGCMAAGCATCATTTAmGTTTAA 

ATGA CMGGTTTAGCTAGCAGAGNAGAGTrTGCNATGCTTTTAAANAAATAACNTTTGAC f 

TTTTCTTCMGACACTACAAMCCATTTGTTCAAAAAAGGCTGCCCAANGTCATTTANAA 

GMTATTTTTCAMANGTCTAmCNTATTTTTAAAAAACTTGCCCT^ 

GGTTTTGAATTCCACCTGGGCCTTTTCTTTTTMCCTTGAMGGGCTTAAA 

Sequence 1546 

GGCGGCGGCCGAGGNACAAGNANCNNTTGGNGGAGGGGGGNGAAACCCAANACCCGAACN 

NGGGACTGNGCAGACAAGCTATATCTTAANCCCNCNCCGGGCCAGACCNCNAGCAAGGGN 

GAGGAAGCAAAAGNCCACAGNNACNGGGGCAGGNAANNGGNANAAANGAGGGNGNGGGGC 
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NGGNTGCGGTGNTTNACAGGGGGAANCCCAACACCCCGGGAGGNCGAGGCAGGAGGANCG 

CCCGAGCCCAGGAGGNNGAGACCAGCCNGGGCAACANGGNGAAACCCNGGCNCNACAAAA 

AANACAAAANNAGCNGGGCANGGNGGNNNGGGCCAGNGANCCCAGCNANNNGGGGAGGCN 

GAGGNGGGAGAAANGCNNGAGCCCAGGAGGCAGAGGNNGCAAGNGAGCCCAAGAANGCGC 

CACCGGAACNCCAGCCCAGGCAACAGAANGAGAACCCGGNCNCACANNAAAAAAAAAAAG 

AAAANGAAAAAGAAAAAANGNCCCNGGCCGGGCCGGGCCGGNNCNAGAACCANGGGGAAC 

CCCCCGGGCNGCAGGAANNCCGAAANCAAAGCCAACNGANACCGGNCAACCCCGNGGGGG 

GGGCCCNGGNACCCANNNNNNNGGNNCCCNNNAGGGGAGGGGNAAANGGCNCGCCNGGNC 

GNAACAAGGGGCAAAAGCNGGNNNCCNGGGGGGAAANNGGNANNCCNGNCNNCAANNCCC 

NNNAAAAAACNAANCCGGAAGCAAAAGGGNAANNCCGGGGGG 

Sequence 1547 

AACACCACCGCGGNGGCGGCCGNCCGGGCAGGNACTTTNNTTNGGGGGNTAAAAAACCCC 

CCNAAAAACNGGGCCCTAGNAGAAGGCAACNTTCATTNCAMCGAGGGGGCCCCNGCCCC 

GNGAAAAAACGGGGAACACGACGNCNAAGGCAGANCCCCGNAAGNACCTACNNNGGACAG 

CCGGGGCAGGCGNGCAAANNNTTTGGGCNNGGCCNCGCAAAGCACAAAGGGGGACACANA 

ACCCACTGCCCACGGCGCAGGAGAAAAAAAAGAAACAGCAAANCACGAGGGGACAN 

Sequence 1548 

CCGGGCAGGTACATCANTTNTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAA 

AGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTC 

CGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCA 

AGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTT 

GGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAG 

CACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGC 

CCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCGGTGAGAAG 

GGGGTGCTCAACCMCTCTTCCGAMCAGCAGCATCAAGAGTTATTTTTCTGACTGTCAA 

GTTTCAACATTCAGGTCTGTCCCCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAAC 

TTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATTCTATGAGGAATTTCTGCGGAT 

GACCCGGAATGGTACCTTNGGCCGNTTCTAGAACTAGGNGGATCCCCCGGG 

Sequence 1549 

CCGCGGTGGCGGCCGAGGTACGCGGGGCTTGCATCTCTGGGGCCAAGGAGTGGTGGGTGA 

GATCTTCCATGGCCCTGGCATGGGTGATATAAGCGGGACCGGTAAGGTGGTGGAGCTCTT 

ACCAGACCCTGCANAACCCTCTCCGTGGTGTTGAACTTCCTGGAACCAGGGTGTTGCATG 

TTTTCCTCATAATGCAGGTTGGTGATGGTGAAGTTGAGGGTGAACGGCACCAGGAGAGGG 

CCAGCAGTTGTGGGGCTGGGGAGGGAGGATGGAGTCCCTGACCCAAGGTCCACTGTGGAG 

GTCCCAGGAGCTGAAAAAAGT 

Sequence 1550 

AGGTACTTTACAAACAAGTCTGAAAAAGGAGGGAGTAAAGTATGGAAGAATGATCTCTGG 

ATGTTGCTACTGGCCTCAAAAAAGCAGTGCTACAGATTTCTGTGTGAAGAGAATACGCTG 

TTCACACATTTTCCTATTTCCAGGCATGAAAATATTCTATTGGGTAGAAGAAATAGGAAA 

ATCTCTTATGACAAATGAAAGACAGGTGCAAACACACCAATCCCTGTCTAGCAGTATAAA 

GCATATTGGGCTCAGAATTTGTCTGTTGCTAGCACCTGGCTTTCATACTATATCCTTATC 

AAATAATCAGATTGAAAGTCCAAATCATTCTTAAGCAAGCAAAAATCCTCAGTGGCCATA 

CCTCA 

Sequence 1551 

GGCGGCCGCTCTAGAACTAGNGGANCCNTTTTGCGGGGGGAAAAAAACCCCAAGCCCANC 

GANACCGNCGACCNCGAGGGGGNNCCCGGNACCCAGCGNNNGCCCCCNAAAGAGAGGGNN 

AANNGCGCGCNNGGCGNAANCANGGNCANAGCNGNNNCCNGNGNGAAANNGNNANCCGCN 

CACNTTTTNNTTTNNCNGACGAGCCGGGNGCANAACCCCCAAANANA 

Sequence 1552 

CCGGGCAGGTACGCGGGCTGCCTGGGGATGGCAGCCGCGTGCGTTCCCTGTGCTGTCCTG 
TAGGGTTGTGGTGGCTTTCTGATATTCAGGTTGGTCCCTTTTCCTT Cl I I I I GGATGGGC 
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TTTCGAGGTTGGGAGAGGAAGCTCAGGGATGGAACTGACTAAACATATGTGAGATGTTTC 

CTTGTTTGCTGCTCATGGAGGTTTCATTTCTGMGTCTTTTCTTGGGAGGGAAAAGGATG 

TGTGGATATATGAGGTGACTCTAGAACCCTCATTTTATGGATGAAGAAACTGTGATTCAC 

AGAGGGAGAGTGATTTGTCCAGTGTCACGTGGGGAGCTGGCCGGGGAGCCAGACCTCCTG 

TACCTCGGCC 

Sequence 1553 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTCCATG 

GGCCCTGTTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCA 

GTCTTTTATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGC 

TGTGGTGAAAGCTGTAMACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAG 

TCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAA 

AATGMTTTAAAMGTAAAAAGGGCTCTGTGGTGTMTTTTTCTTAACTACAAGAGTCTA 

AATACACTGCTTTTCTTTMGAGTTCATTTTMTTAGTMCGTCAMCAAAATTATTCTA 

GATAATGAGCCCTACAAA 

Sequence 1554 

ATCGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 
GGGTGGCCAGATTMTTTTCCAGCCMTAGAGGAMGTTTTTATAT AI Mil I GCTAAGG 
TGCTGGAGAGGGGAAAGGATGCCACACTTCAGAAGCAGGAGGACGTTGCTGTGGCTGCTG 
CAGTCTTGGAGCCCCTGCTCAAGCTGGCCCTGCTGGCCGGCCTGACCATCACTGI I I I IG 
GCTTTGCCTATTCTCAGCTGGCTCTGGATATCTACGGAGGGACCATGCTTAGCTCAGGAT 
CCGGTCCTGTTTTGCTGCGTTCCTACTGTCTCTATGTTCTCCTGCTTGCCATCAATGGAG 
TGACAGAAGTGTTTCACATTTGCTGCCATGAGCAAAGAGGAGGTCGACAGGTACCT 
Sequence 1555 

CCGCGGTGGCCCGCCCGGGCAGGTACAGCAAAAAAGAAACTGAGAAGCCCAAACTGCTTT 

CTTGTTAACATCCACTTATCCAACCAATGTGGAAACTTCTTATACTTGGTTCCATTATGA 

AGTTGGACAATTGCTGCTATCACACCTGGCAGGTAAACCAATGCCAAGAGAGTGATGGAA 

ACCATTGGCAAGACTTTGTTGATGACCAGGATTGGMTTTTATAAAAATATTGTTGATGG 

GAAGTTGCTAAAGGGTGAATTACTTCCCTCAGAAGAGTGTAAAGAAAAGTCAGAGATGCT 

ATMTAGCAGGTATTTTAATTGGCAAGTGCCACTGTGGAAAGAGTTCCTGTGTGTGCTGA 

AGTTCTGAAGGACAGTCAAATTCATCAGCATGGGGCTGTTTGGTGCAAATGCAAAAGCAC 

AGGTCTTTTTAG 

Sequence 1556 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCCACATCCCGAAGGCCATAGCGTC 

GAGCAAGGTCACATGGATGGCAGCACCTTACCACTCAGGCTCAGGATATTGGGATCTGTT 

GCCMAGCCACCACACATTTGCCACTCAATTCTGTGGTTTCCGCAGATGAGAAGGCTGAT 

TTGMCTGCTTCMCACAGGATCCTGCAGGACCTCCTCCTTTGCCATATGCTCCTTCAGC 

AGTTCTGTCTGCACAATCCCCGGCCACAGAGACACACAGCTGACCCCATGGCGCCGCAGC 

TCGTGGGCACAGTCAGCAGCCAGCTTGTCACACGCAGCTTTGCCCACACCATAGGGGACA 

TTGAACATATACTGCAGGCTTCCTGGGGAGGAGATGACCACGATNAGCCCCTGGCCAGCT 

GGTACCT 

Sequence 1557 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCAGGTACGCGGGGGGT 

CCTGGTGCTGCGGCCACACCACCCTGGGGTGCTCATTGACAGAGCTGCCATAATGAACTT 

GAAAGGACGGGAATCACCGAGGGAAGCTGGGGCTCCCCTGCCCACAGGAGAGGATCCCCG 

TTCTTCAAGCTTCTCTGCTCAGTGTCTACTAACGACCGACATTTGCTAATGTAAATAATA 

GTAAATTATTGAGAATTCTAATTCTTTTACACAGTCTG I 1 I I I AATCTATTTTAATTAAA 

TAAAATCTATGACT 

Sequence 1558 

ccgcggtggcggccgggggccattgAgactgccatggaagacttgaaaggtcacgtagct 

gagacttctggagagaccattcaaggcttctggctcttgacaaagatagaccactggaac 

caatgaggaaggagagaattctactggtcacagacaagactctcttgatctgcaaatacg 
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ACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCA 

TCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCC 

TTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCCCTTCTGTCCCGCTGGAACCCATGGT 

CCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAAT 

TCCTTGAAATTTGCAAGTTGTCTGGGTTCATG 

Sequence 1559 

TACT T AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCAGGCACAGCTAA 

TTTTACTTTTTATTTTCTTGAGACAGGTCTCACTCTGTTGCCCAGGCTGGTCTTGAACTC 

TTGGCCTCAAGTGATCCTCCAGCCATGAGCCACTGTAAATGTCTGGGAATGCCAACTTGA 

AGCCCAGCTGGTCAGAAGTTCTAGAGGCCCAGACTTGCAACTGGTGTCTGAAGGGATGGC 

AGTCTTGGGGACTGAGCCCTCAACCTGAGGGATCTGATGCTATTTCCAGGCAGATAGTGT 

CAGMTTAAATTGGAGGATACCGCAGTTAGTGTCTGCTGCAGAACTGATTGCTTGCTTGC 

TGGTAGGGGGGAAATCCCCACATATTGGGGGGGTATTTGMGTCTTTTGGTGTTGACTCT 

TACTGNGTTTNGTTGGTTTTTCATGTGAAGAGCAGAGGGAAAAGCAAGGGA 

Sequence 1560 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGACCCAACAACTGTGACTA 

GCTGGCCACGTCATTCAGGGCTGGTGTGGCATTTATGTGTGTGTGTGTGTGTGTGTGTTT 

TTCCTGTTTGCCCAGCAGTGCATTGTGGGTTCCAAGAGTGGGTAGTGTGTGTATGTGTGT 

GTGTCAGAGGGAGACCTGGCAGGCACCTNTTTGAGAGTAGCTGTGGTCAGAGCTGTTTGG 

TCAGTGCATTATGTTGAATGAGGTCCAGGAACCCAGAGCCACCCAGCAGACACCACTGTG 

GCTTGCCAGCTGCCAAGATGGAGAAGCATGTGCCCCTGTAGAGCGTCTTCCCAGAACCAG 

ACCCC GAGCCACTCGCTTCCTCTGTGCTGNGACAACATTGGTGCCAGGGGAGATNGTTNT 

TTTTTCAAAGGGGACCTACTGTAGCCCCTTTTAAT 

Sequence 1561 

CCACTCACTATAGGGCTGAATTGGAGCTCCCCGCGGTGGCGGCCGGAAGAGCAACCGAGA 

TGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACT 

TACAGAGAGTTCCAAGMACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACTGAGAA 

TATGTMGAGCTTTAMTGCTACCAMCTGGMCGAGTATTTGCAAAACCATTCCTTGCT 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCTGGCT 

ACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAAT 

TGTATCCGTTACCT 

Sequence 1562 

GGGCGNGTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCCTGCCTTCCCCCATATA 

CTGAAGTTTGAGAGGCTGGGAAGGTGCGGAATGGGAAAAGGAGCAGCTGCTTATGTTCAG 

TTTMCATTCTCTGGGTTTCTCCATCTAGGTCTTGAGTTCATTCTCTTCCTGTCCTTTTG 

GCTTCCTTGTTTAACCTGGTCCCTGTTTCAGGAGAGAAGCCTCATCAGTGCCAAGTCTGT 

GGGGAAGACCTTCTCTCAGAGTGGAAGCAGGAATGTGCATATGAGAAAGCATCACCTGCA 

GCTGGGAGCAGCTTGGGAGTCAAGAGCAGGAGCAAACTGCTGAGCCACTAATGGGGCAGT 

AGTTTGCTTG 

Sequence 1563 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGC 

CAGCCCAATGCACAGCAGTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGA 

CTTTGTCCAGCTGCAGGTGCCTGCAGGTCACACTGGCTAACTACTTTGTGATGGGCTCTT 

CTTTCTGAGGTTCTGCCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATA 

TTTCTTTCATTTGCTCTGAGCTTMTATTGTMTTTGTA7TTGATCTGCTGGGTCTTTGG 

AGTCAGGACTGGTTTTATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGC 

CCACAGAACCTTCACGTGTATTCACAGCCTCAATGCCATMGGAMCTCTTTTAGAAGTT 

CTGACAGCTGGTCATGTAGGTATAAGACAGGTGCCTTATCACTGTGGATTTCATTTCTTG 

Sequence 1564 

CCGCGGTGGCGGCCGAGGTACAMTTGTCGTTTTTATTCCTCTTATTGGGATATCATTTT 
MAAACTTTATTGGGTTTn*ATTGTTGTTGTTTGATCCCTMCCCTACAAAGAGCCTT^ 
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TATTCCCCTCGCTGTTGGAGCAAACCTTATACCTTACTTCCAGCAAGCAAAGTGCTTTGA 

CTTCTTGCTTCAGTCATCAGCCAGCMGAGGGMCAAMCTGTTCTTTTGCATTTTGCCG 

CTGAGATATGGCATTGCACTGCTTATATGCCAAGCTMTTTATAGCAAGATATTGATCAA 

ATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCCA 

ACTGCTMTGCAAATTAAMCATATTTCATTTTMCATGATTTCAAAATCAGTTTTTCAT 

ACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGAA 

TGGGGTAGAAACATTGTAAATATTTACTCTTTGCAAACCCTGGTGGGTATTTTATTTTGG 

CTTCATTTCAATCATTGAAGGTATATTCTTATTGGAAATGTACCTGCCCNGGCCGGCCGC 

TCTAGAACTAGTGGAATCCCCCNGGC 

Sequence 1565 

ACTACT ATAGGG GCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 
TTGGTGTTTTTTGGAAAAGTTACATTTAGATCTATTCT 

TAAMTGTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGT 

CTGTGAACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAG 

TCAGAAAAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGT 

GCACGTAGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGT 

GAGMTTTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCA 

CTTATGCAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTCACTGGTGGGATGTGGCC 

CTGCTTGTATTCAGAACCATAGTTTTMCMGCCANCATTAMTCCACAAGTCTTTGCCA 

MGCACTTTAAGCCTNTTGACATTTATTGGMTTAATTTACCTGCAAGGAAAGTTCATAT 

ATCTAGCTTTGGTAACCCTACATTCGGGAAAAATGTTTCCATGANATAACTAAAANCCCC 

ATGAATGATACAATCTTGGACAAAACCCAAAGNNGGCATAATTAGCATAAAACTCCAAAT 

Sequence 1566 

CCCGCGTCCGGCATCTCCCAACGTGACTGACCCGCCGACCACGACCCGCAAAGTGGTCCC 

GACGACGCTCACCACCACCAAGCCGCCAGAAACCTGTGAGAGCTTCAACAGCTGTGTTTC 

CTGTGTCAACGCCACCTTGACTAATAATATTACCTGCGTCTGGCTAGATTGCCATGAAGC 

AAATAAGACCTATTGTTCAAGTGAATTAGTAAGTAATTGTACCCAGAAGACCAGTACTGA 

CTCCTGTTCTGTAATACCTACCACCCCAGTGCCAACCAATTCTACAGCTAAGCCTACAAC 

TCGGCCTTCCTCTCCTACACCTACTCCCTCAGTTGTCACATCAGCAGGTGCAACAAATAC 

CACTGTGACTCCAACCTCACAGCCTGAGCGGAAGTCCACCTTTGATGCAGCCAGCTTCAT 

TGGAGGAATCGTNCTTGTCTTGGGTGTGCAGGCTGTMTTTTCTTTCTCTATAAATTCTG 

CAAATCTAAAAGAACG 

Sequence 1567 

TCGCCNCGCGTCCGGGCAACTGCAGTTGGAAAAAAAGATTCAACTTCAAAGCAGAGGATT 

TTTGATGAAGAACCAGCTAATGGAGTGAAGATAGAAAGGTTTACAAGGGATGATCCTTGG 

TTATCTTCAT GTGAA GAAGTGGATGATTGTAAAGACCAGTTGGAGAAGCAACAGGAAAAA 

CMGAGATACTTTTGCAGGAAGTGGCATTCACTCAAAGGAAAGCAGTTATTCATGAGAGA 

GTCTGCMMGTGATGAMCTGGGGAGMGAGTGGTCTGMTTCCAGTCTATTTTCATCC 

CCAGTTATACCCATAAGAAACCATTTTCATAAACATGTATCACATGCTAAAAAATGGCAT 

CTTAATGCTGCTGTAAACAGTCATCAGAAGATTAATGAGAATGAGACACTATATGAAAAT 

MTGGMTGTGGAAMCCCCCTCAGAGCATTCACCTTATTCAGTTTACAAGAACCTCAAA 

CA 

Sequence 1568 

GCTCCATGCCCTTCTCTGAGACGGGGACCAGGGGATGGCAGNCATGCACCTGACAGCCTG 

GCCCNAGAAGTCGGTGACCTTTGAGGACGTGGCTGTGTACTTCACCCAGGCGGAATGGGA 

TGGCCTGTCCCCTGCACANAGGACCCTGTACAGGGATGNGATGCTGGAGAATTATGGGAA 

TGTGGCCTCCCTGGGATTTCCACTTCTCAAACCTGCTGTGATCTCACAACTGGAGGGAGG 

AAGTGAGCTGGG 

Sequence 1569 

CGCGTCCGTTTCTCCTGGCACCCTGTATTCATGGCCTTGGCGTTCTGCCTCTGCATGGCT 
GAAGCCATCCTACTCTTCTCACCTGAACACTCCCTGTTCTTCTTCTGCTCCCGAAAAGCA 
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CGGATCCGGCTCCACTGGGCAGGGCAGACCCTAGCCATCCTCTGTGCAGCTCTGGGCCTG 

GGCTTCATCATCTCCAGCAGGACCCGCAGTGAGCTGCCTCATCTGGTGTCCTGGCACAGC 

TGGGTGGGAGCCCTGACACTGCTGGCCACTGCTGTCCAGGCACTGTGTGGGCTCTGCCTC 

CTTTGTCCCCGGGCAGCCAGGGTCTCAAGGGTGGCTCGCCTCAAGCTCTACCATCTGACA 

TGTGGACTGGGTGGTCTACCTGATGGCTACAGTAACGGTGCTTCTGGGCATGTACTCAGT 

ATGGTTCCAGGCCCAGATCAAAGGTGCGGCCTGGTACCTGTGCCTG 

Sequence 1570 

CGTCCGCTAAGTTCCAATATTGAAAAATCTGTAAAAGACCTCCAGCGCTGCACAGTGTCT 

CTTGCACGGTATCGAGTTGTAGTTAAAGAAGAGATGGATGCCTCCATTAAGAAAATGAAA 

CMGCCTTTGCTGMTTGGAGAGCTGTTTAATGGATCGAGAAGTGGCGTTGCTTGCTGM 

ATGGACAAAGTGAAAGCTGAAGCAATGGAAATTTTGCTCAGCCGACAAAAGAAGGCTGAA 

CTTCTAAAGAAGATGACTCATGTGGCTGTTCAAATGTCAGAGCAGCAA 

Sequence 1571 

GCAGCCGGCCATGCAGGCCGTCATCCGAACTGACATGGACCAAATTATCACCTCCAAGGA 

GCACCTTGCCAGCAAGATCCGAGCCTTCATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAA 

CCATGTCCAGCCCTACATCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGG 

CTTCACTGAGGTGCGAGATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCAT 

CAACGAGGGCCGGCATTGACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTTGGCG 

TACCACCCCCTGAAGATGCAGAGCTGCTATGAGAAGATTGGAGTTCGCTGCGACTGGACG 

GGCTGCAGC 

Sequence 1572 

CCGMCAANGTGGCCACCCAGGTTTTTAACCCAAGTCTAGTGGTCATCCTATTCTTTCCA 

CACCAACATGCCAAAAGCCTTACCTNGAAAGAAAATATAATTTGCAAGAAGCATCACAGT 

GCCGGGGTCTATATTCTCGATCAGGTTGNTAATTTTCCCATGGG l I M I I G ACTGATAAA 

GNCATTGATCTGCTTCTGAGCCATTTCCAAATTCTGAAAGTTGGTAAGGATGGTTTCGGN 

ACTGTAAAAGTTCTTGGCATCTTCC 

Sequence 1 573 

CGCGTCCNGGNCGGAGAAGACAGTAGGGATACTGGATATGGGAGGAGCCTCTCTCCAAAT 

TGCTTATGAAGTTCCTACCTCAACCTCTGTCCTTCCTGCAAAGCAGGAAGAAGCTGCCAA 

GATCCTGCTGGCTG AGTTC AACCTGGGCTGTGATGTGCAACACACTGAACACGTGTACAG 

GGTTTATGTCACAACTTTTCTGGGTTTCGGAGGCAACTTTGCCCGGCAGCGCTACGAAGA 

CCTTGTTCTGAATGAAACTCTTAACAAAAACAGATTGCTTGGTCAGAAGACAGGTCTGAG 

TCCCGACAATCCATTTCTGGATCCCTGCCTGCCAGTGGGACTCACAGATGTGGT 

Sequence 1574 

CGCCGTCCNGTTTACTTGGAGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTT 

TCAAGANNNTGGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATGCACATG 

GCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGGAAAA 

AGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCNGTATGAAAGCT 

GGMCTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 

AAAGMGAGMTTTGATGCTGCTGTTMTTCTGGAGAACTGTGGTTTGTAMTTTTTAC 

Sequence 1 575 

GAGGCGCTCAACCTACCGAGGCGCCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCG 

CTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTG 

GCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACTGCG 

CCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTT 

CCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGC 

GCCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCMCGCCAT 

CCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTTCAAGTTTCTCGACATCG 

GATACAATCAAGC 

Sequence 1576 

GACCACGCGTCCGCGCACCGCTTCATTGAGGCTGCAAGAGCACACGGGCACCCACGTGCT 
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GGTCCACTGCAAGATGGGCGTCAGCCGCTCAGCGGCCACAGTGCTGGCCTATGCCATGAA 

GCAGTACGAATGCAGCCTGGAGCAGGCCCTGCGCCACGTGCAGGAGCTCCGGCCCATCGC 

CCGCCCCAACCCTGGCTTCCTGCGCCAGCTGCAGATCTACCAGGGCATCCTGACGGCCAG 

CCGCCAGAGCCATGTCTGGGAGCAGAAAGTGGGTGGGGGTCTCCCCAGAGGAGCACCCAG 

CCCCTGAAGTCTCTACACCATTCCCACCTCTTCCGCCAGAACCTGAGGG 

Sequence 1577 

CTACACTCAACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAG 

CTACATTCAACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGA 

GCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATG 

GGGCAGCCACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGC 

TGGACATACAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGTGTCACCCAACTGG 

GCTTCTATGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAGAATTTAT 

CAATCCGGGGCGAGTACCAGATAAATTTCCACATTGTCAACTGGAACCTCAGTAATCCAG 

ACCCCACATNCTCAGAGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACAC 

TTTTACAAAGGCAGTCAAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGAC 

GATGGACTTCCGTGTTGGTCACTGTCAANGCATTGGTCTTCTTCAATTTG 

Sequence 1578 

GCGGCCGCCGGGCAGGTACCTMCCTACCTTTAAGACTGGGATMCTATTGNNNTNCAAT 

AGNTTTATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGC 

TTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGA 

CGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1579 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATNTCT 

ATTGNTAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1580 

CTCCCCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGT 

AGGCCNTTNCCCTACCMCTMCTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGA 

AGCTCCTTTCTATTACTCATCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTC 

CAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1581 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTA 

TTGTTNAACAATNNCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 

Sequence 1582 

AGGTACAAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGNTAGGCCNTTACCCTACCAA 

CTAACTAATGTTCCGCACCCCCATTTTTAAGTGMGCTGTGMGCTCCTTTCTATTACTC 

ATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCT 

TAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1583 

CCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAMCATCATCGTCTTTTTNGGC 

CATTACCCTACCMCTAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCT 

CCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAAT 

AGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1584 

T CTTC GANCACGNTTCGGGCGGCTTTTCCCCGGGCAAGGCTTCTAAATCGGGGGGGCTTC 

CTTTTAGGGGGTTCCGAATTTAAGTGGCNTATAACGGGCANCCTTCGAACCCCCAAAAAA 

AACTTTG 

Sequence 1585 
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AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTNGGTAGGCCATTACCCTACCAAC 

TMCTMTGTTCCGCACCCCCAT11TTMGTGMGCTGTGMGCTCCTTTCTATTACTCA 

TCATGCGATAMTAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1586 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGCnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCCCAACGT 

GAACCGGTTACACTTTGCCMGCGCCCTANCGCCCGGTCCTTTTCGCTTTTCTTTCCTT^ 

CCTTTTCTTCGGCCACGGTTCNNCCCGGCTTTTTCCCCGTCMGGCTTNTAAATTCGGGG 

GGCTTTCCTTTTMGGGNTCCCGATTTMGTGGCTTTANCNGGNMCCTTNGAACCCCCA 

AAA 

Sequence 1587 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TMCTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1588 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1 589 

TACCNCGCGTCCGGGGCCCGGATGCTGGGGGCCACCAGGGCCCCGGGGATGTGCTGGTCT 

TCATGGATGCCCACTGCGAGTGCCACCCAGGTTGGCTGGAGCCCCTCCTCAGCAGAATAG 

CTGGTGACAGGAGCCGGGTGGTATCTCCGGTGATAGATGTGATTGACTGGAAGACTTTCC 

GGTATTACCCCTCGAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCATT 

GGGAACCTTTGCCGGAGCATGTGAGGAAGGCCCTCCAGTCCCCAATAAGCCCCATCAGGA 

GCCCTGTGGTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAG 

CGTATGACCCTCTTATGTCGCTGCGGGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCT 

GGCTCTGCGGTGGCTCCGTTGAAATCCTTCCCTGCTCTCGGGTAGGGCACATCTACCGAA 

ATCAGGATGCCCCGTCCCCGTTTGACCAGGAGGCCACCTTGAGGAACAAGGTTCGCATTG 

CTGAAGACCTGGCTTGGGGTCA 

Sequence 1590 

GGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATCCACATATTTTTGAG 

AAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACC 

ATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACA 

GCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGC 

CTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTC 

TGGTTCTTCAGCCTTGCTCCTTC 

Sequence 1591 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTGMTATCMTTTCTAAATATTTACCC 
AAAAATGTATATTTAAAG I I I I I AACAACTTCTTTTCAATCTTGAAAAAGTTCCCTCATT 
TT7TCTTTTAAAATCCCATTMCMTCAATG 

TGTCCATTTAAAGAAGAATATGGATCTTGGAAAATCATTTGAAT A l I I I I I CTTAAAAAT 
AGATTTTG I INN I GTTAATTTGTTTCCAGAAATAACTTCATCTAATAAAGTAACTTGA 
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fcCACCGTMGATTCATATAATCTAATCAAAGATCTACCAACTGGTTGTTTTACCCTGGAT 
CCAGATTCACCAATTAAACCTAAAA I I I I I CCTTGTTGGATCTAAAATTTTACCGTTATC 
AACAGCTTTGACAACCCAAANTTATTGGAAAAATA I 1 1 ! I i I AAGCTTTGAACTTTTAAG 
GGTGGGTTAACTTGCATTA 
Sequence 1592 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGG 

GTTTCCCTGCTATGTTTTGTCACACATCATTGCTGGGAGAGTTCAGTGCTATCCAAGTGA 

CTCCCCGGGGAGAGGACAACTGGAAATTTATGCCTGGTGTCTTCCTGGACTCCGACCTAT 

GCACCTTTTTCCTTGGATGATTTTMCCTGTCTCCTTTTGCTGCAATGAACCATAACCAT 

GAGTATATCAGCTTTTCTMTTTTTGTGAAGTCCCTCAAGTGAATTATCAAACCTAAGGG 

TAGTTTTGGGGACCCCCATCACAGAAGAGGTCAAACAGGGGGCAGG 

Sequence 1593 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAA 
ATTATTATTTATCTATACAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGC 
TAAAAAAAAAAAAAAAAAAAGAGGGAGAATGAAAACTTCCGGACGCGTGGGTCGAAGCTT 
GACCT 

Sequence 1594 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGGCTGTCTACACTGACATCATCCAGGGCAAGCT 

GGACCAGCGAAACCAGCTGCTGGAAGTGGATTTCTGCATTGGCCGTGACATCCGAAAGAA 

GGATATCAATAATATTGTCAAGACCCTGCATGAATGGTGTGATGGCTGTGAAGCAGTTCT 

ACTGGGCATNGAGCAGCAAGTTCTGAGAGCCAACCAGTACCTT 

Sequence 1595 

CCGCGGTGGCGGCCGCCCGGGCAGG I I I I I I I I I I II I I I I CTTTCTGTTCCTTGGACTA 

GATMTCTGMATCMCTGTCTTCAGTTTTGCAGACTCTTGTGCCAGCTAAAATGTTCTG 

TTGAGCCCCAGAAGCTAATTTTCTTTTC^ 

TTTTTAATATAATTTCTACCTC I 1 1 1 1 I ATATTCTCCATTTGGTGAGACATTCACATACT 
TTCTTCCAG I I I I I I 1 1 AG ACGTAGTTTCCTTGAGTTCTTTGAGCATATTTAAAATAGTT 
GATTTAMGTATTTGTCTAGTTACTCCACCATCTGAGTTTCCTCAGGGAAMTTTCTATT 
GCCTCCTTTTTTCCTGTGTGTGGTCGGNCCATACGGACGCGTGGGTCGAAGACCT 
Sequence 1596 

ACTTNNTN I I I I I I I I I I I I I I I 1 1 I I I AAGCGCCCGGCATTTTCTAAATAAAATCATTT 
TATTTGGNAAAAGGGTTTTMCAGNTATACCTTTCTAGCTAAAAGAAAAGAAATAGCGGG 
ATGTACCT 
Sequence 1597 

AGGTACGAAGAGAAAGGAATCAAAGCCTACTANCTCAAAAAATTGTCAAATTGCAAATGA 

GGACATCTAGAGAGGAAGAAAGGAAAAAAGGAACTAAAAAACAGAAACAATTAACAGTAA 

GTTCTTAACTATCAATAATTATTTTAAAAGTAAATAGATTAAATTATCTAATCAAAAGAC 

ATTGAATGGCTGAATGGATTAAAAAACAAGATCAACTATACATTGCCCATCAGAGATTCA 

TTTTAGCTTTAAGGATAAACACTGGTTGAAAGTGAAAAAAGTCAAGGCTGGGCATGGTGG 

CTCATGTCTATAATTCCAGCACTTTGGGAGGCCAAGGTGGGCAGATAATCTGAGGTCAGG 

AGTTTGA 

Sequence 1598 

CCGGGCAGGTACCACCTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 

TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCGGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAGCAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTAT 

Sequence 1599 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACAGGACCAATGCTGCC 



# 
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CATCCACATGGAATTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGT 

GGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATAC 

TTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGG 

GAAATCCATGCCATATGACTTTGATATTCGTGTGCCI I 1 1 1 1 I ATTCGTGGTCCAAGTGT 

AGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACA7TGACTTGGCCCCCACGATCCT 

GGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCT 

GGACCCAGAAAAGCCAG 

Sequence 1600 

TCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTCACTGTCTCATATAATCNCAGCCTCC 

TGTGTGATAGCTGGTGTCATCTCCACTTACAGATGAGGAAACTGAGGATAAGCAGGGTTG 

AATAACTTGCTCGAGATCACAGAGCCACGGGTGGNGAAACAGGATACAAACCTGGTTCTG 

TTTGACTCTAAGACCATTCATNTTTCCTCTGAAACTCAGTATTGCACAGTGTAGAAATGC 

AGTTTTTAAGACCTCCCAAAGTGACGTGCTGNGTCACTGCCCATCATTAGCTANATTGAG 

TMATTGCTGCTTAGCCCCANTTGTTTTGACAGAATCAATAGCCCTTGCTGAGGGGCCAN 

CAGCCTACGGACACAGGAGCATGCTTCATGGGCAAGACCACCATGCACACTCAGAGGGGA 

AGCCACAAGGCAACCTCCACGCCACTTAAGATTTGTAGGGCTCTGAACACATCACCAGAT 

ACAGACCACCTACTTATTTTTTNCACTGTAATANCAAAGGCAGGMTCTTTTTNCTGTAG 

GGTAAAGTTTGGGGG 

Sequence 1601 

GGCAGGTACAAGGCCCCAAAGAGGAGGAATTCCTTGTAGAGGAGCTTGTAGATGCTTCCC 

CTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTGAGAGTG 

TATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCCCCAGGG 

CTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCCACACCCCAGCCTTCA 

CCCTGCGTCAAGTGGACAAG 

Sequence 1602 

AGGTACCAGTGGGGACTTCTGAAAGAACNNTACTNGTGTCAGTGGAAAAGCTGGCATTTT 

GGGAAATGCTGGTCTCTCTCAGTCCAGGAGTCMGGAATATGTTGACTCTCTCTTAATTT 

TTGTAGTCTCAGAGGAAACAGACATTGATGTGGAAACAGTTGTATGCCCCATGGTGGAGG 

TGGTATCCATNGGAGCTGTGGCCTTGGTTTTTTCTGAGTCAGCTAGGACAGAGGATTGTG 

ACCCATGTCCAGAACTGGTGGTTTCCACATTAGTCGCTGCTGTGCTTGTGGAAGGATGCA 

TGGCTTCTATAGCTGTGGTGTCTTCATCTGTTGTCAGTATCTCATGTGAGGNACCTGCCC 

G 

Sequence 1603 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

GGCTGCAGTMGTTTGTCTTMGAMGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTTTCCTCA 

Sequence 1604 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCACCTGAAGGCCCTCACACTC 

AACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAGCTACATTC 

AACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGAGCAGCATG 

GGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCC 

ACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGCTGGACATA 

CAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGGTGTCACCCAACTGGGCTTCTA 

TTGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAAAATTTATCAATCC 

GGGGGCGAGGTACCTGCCCCGGGCGGGCCGCTTAAAACTAGGNGGGATCCCCCNGGCTTG 

CAGGAATTTCGATATTCAAGCTTATCGATACCCGTCCNACCTTCGAGGGGGGGG 

Sequence 1605 

CCGGGCAGGTACCACNTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 
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TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCTGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAACAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTATCAATCCGGGGCGAGTACCT 

Sequence 1606 

CGGCCGCCCGGGCAGGMCNNNNTTTTTTGGGGGGGGGAAAACCNAGACGGAGCCNCGCN 

CAANGGCCCAGGCGGGAGTGNAAGGGCACCAGGGGGGGCNCACCACAAANACCGCCGCCC 

G GGNGAAAGCC ACNCNCCGGCCNNAGCCNCCGGAGNAACGGGGGGAACAGGGGCAGGCCA 

TNI 1 1 1 1 I I I I IGNGGGGGGNGNANGGGGNGGANNCNCCAGGNAAAAANCANGCNGGCCA 

GGGGGGGGGGGAACNCCNGACCTNATGANGCACCCGCCNNGGNCNCCCAAAANGCGGGGA 

NNANAGGGGNGAGCCACCGNGCCNAGCNGACGG 

Sequence 1607 

CGAGTTACCAGAAGGAGAGATCACCACCATCGAGATCCACCGCACTAACCCGTACATCCA 

GTTAGGAATCAGCATCGTTGGCGGCAATGAGACGCCACTGATCAACATCGTNATTCAGGA 

AGTNTACCGGGATGGGGCCATCGCCAGAGATGGAAGGCTCCTTGCCGGAGACCAGATTCT 

TNAGGTCAACAACTGTGATATCATGCAACGTGTCCCATAACTACGCCCGGGCTGNCCTTT 

CCCAGCCCTGCAGNACCCTGCACCTGACAGNGCTTCGGGAGCGGCNGCTTNGGCAGTCGT 

GCAAA 

Sequence 1608 

CGAGCCTTTAGATGGCGTCTCCTCAGGGGGGCCAGATTGCGATCGCGATGAGGCTTNGGA 

ACCAGCTCCAGTCAGTGTACAAGATGGACCCGCTACGGAACGAGGAGGAGGTTCGAGTGA 

AGATCAAAGACTTGAATGAACACATTTGTTTGCTGCCTATGCGCCGGCTACTTNGNGGAT 

GCCACCACCATCACAGAGTGTCTTCATACTTTCTGCAAGAGTTGTATTGTGAAGTACCTN 

CAAACTAGCAAGTACTGCCCCATGTGCAACATTAAGATCCACGAGACACAGCCACTGCTC 

AACCTNAAACTGGACCGGGTCATGCNGGACATCGTGTATAAGCTGGTGCCTGGCTT 

Sequence 1609 

GCGTCCGACGTCCCCCAGGAGAATGGTAGACACAGATGAGGAAATTGTGGAGATGGGCAC 

AAACCGCAAGGTGAAGAAAACGAACAAACACCGAGTTGATACGGATAGTCCCCGTTCCCC 

TGAGGGCCGACCCCGTGACTCCCGATGAGCGTCCAGTTGCGCCGGGCATCCCTGGGCCTC 

CCAGCGTCTTTCCCGGAGGTTCATCGCCGCACGGCGGAAAGCGCTCTCGGTTCCGCTTTC 

CGGCCCCAGCCTCCCGGGCGCCCTCGCGCGGCGGCTAACGCTGGTCCTCGGCCGGGCGCG 

CTGACGTCATCGTGCGTCAGAGTGAGCCCGGATGGGGCGGCGGGCTTCGGGAGCGCCCGG 

GCTGATCCGAGCCGAGCGGGCCGTATCTNCTTGTCGGCGCCGCTGATTCCCGGCTCTGCG 

GAGGCCTCTAGGCAGCCGCGCAGCTTNCGTGTTTGCTGCGCCCGCACTGCGATTTACAAC 

CCTGAAGAATCTTCCTATCCCTAT 

Sequence 1610 

CGCGTCCGGCGGCGGCGGCTGAGGAGGGCCCGGCCTGCGAGAGCCTCAGTGGGAGCCGGC 

TCAGCCCTCGGCCACCATGTCGGCGCCGTCGGAGGAGGAGGAGTACTGCGCGGCTGGTGA 

TGGAGGCGCAGCCGGAGTGGCTGCGCGCCNAGGTGAAGCGGNTGTCCCACGAGCTGGCCG 

AGACCACNCGTGAGAAGATCCAGGCGGCCGAGTACGGGCTGGCGGTGCTCGAGGAGAAGC 

ACCAGCTCAAGCTGCAGTTCGAGGAGCTCGAGGTGGACTATGAGGCTATCCGCAGCGAGA 

TGGAGCANCTCAAGGAGGCCTTTGGACAAGCACACACAAACCACAAGAAGGTGG 

Sequence 1611 

CGCGTTCGAGTCTGGAGACGACGTTNCGAAATGGCACCTCGCAAAGGGGAACGGAAAAGA 

AGGAATGAACAGGTCATCAGCCTTGGACCTCAGGTGGCTGAAGGAGAGAATGTATTTGGN 

GTCTGCCACATCTTTGCATTCTTCMTGATACCTTTGTCCATGTTANTGAACTTTCTGGC 

NAGTGAGTACTTCAGAAAGGCATNAAACANGCCTCAAAGGGAC 

Sequence 1612 

CCCCGCGTCCGCCCACGCGTCCGGGCTCGGCTGCACCGGGGGGATCGCGCCTGGCAGACC 
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CCAGACCGAGCAGAGGCGACCCAGCGCGCTCGGGAGAGGCTGCACCGCCGCGCCCCCGCC 

TAGCCCTTCCGGATCCTGCGCGCANAAAAGTTTCATTTGCTGTATGCCATCCTCGAGAGC 

TGTCTAGGTTAACCGTTCGCACTCTGTGTATATAACCTCGACAGTCTTGGCACCTAACGT 

GCTGTGCGTAGCTGCTCCTTTGGTTGAATCCCCAGGCCCTTGTTGGGGCACAAGGTGGCA 

GGATGTCTCAGTGGTACGAACTTCAGCAGCTTGACTCAAAATTCCTGGAGCAGGTTCACC 

AGCTTTATGATGACAGTTTTCNCATGGAAATNNGACAGTACCTGGCACAGTGGTTAGAAA 

AGO 

Sequence 1613 

GTTNAGTNGMGTTCTCTACCATTGAATCAGTGAACTAGAAAGATCTGATTTGGCCTGGG 

ACCAGTGTTCMGTTGGTTTGGTC7TTATTAAAAATCACAATATTCCGAAAACAAAAAAA 

CCTAGGAGATAMTGTAGAGGTATTGACTTTTCGTATCTTTTATCTTCACACTGAAACAA 

GAGCTATCCTATTTGATTATTAAAGTGAGCTATGTGTTAAGTGCCAGGACATTTCTAGCT 

TTTGTGAGAATGTGTCTACATATGAGTATAATAAACCCACATGTATACACAATTGTCTCT 

TATGTACTCCTACCTGGCAGGAGTCTTTG 

Sequence 1614 

CGCGCCGGTGGTGCGATCTCGGCTACTGCAACTTCAGCCTCCTGGATTCAGGCAACACTC 

C TGCCTCA GCCTCCCACGTGGCTGGGATTACAGGTGCCTGCCCCCATGGCTAAI M I N G 

TATTTTTTGTAGAGATGGGGTTTCACCATGTTGGCTGGGCTGGTCTCACTCTCCTGACCT 

CAAGCAATCTGCCTGTCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCC 

CCCAGCCTGAGCCI I I I I I I I I I I CTAATGCATCCAAGGTTAAGGGGAAGACGCAAATAA 

CAGGACTATTCTAAAAGGAAACCTGTTTGAACTCTGTGAGATCAAGTCATCAGTCTCAGT 

AT TNCA CAGGCACAC CTTM TTTCATTGGTAAMGATATATATATTTTGNCTATTTTTGN 

GCTTTTGGGGGCCTATTTTGNGCTTTTTTACCTTTMTGNMGMGANCTTM 

GTGGAI I I 1 1 I IACCA 

Sequence 1615 

TCGCCNCGCGTCCGTAGAAACTCACACTAGACACACGCAGTAGTCATACGTCTTCACACG 

GTTTAGGAGCTACTGGACCAACATTCTTGT1 I NGCTTTTGM I I I I I AAATAATTCTAG 

TCTGGAGCTAACTGTGGAGCAGCCAAATAGTAGCTGGCATGTTGATTCAAACCATGGGCT 

GMTTTGCTCATAGGCTGTGCATCAGACMMGCTTGMTATTTGTGTTGTATGCTTGTT 

CCAACCACCGQTTGTG TGAGCAT TTTTGTGGCTTGTACAGAMGTACACTTTTAAATTGT 

CTCT TGCATCACTAMATTTTTTTAAAATGAGCATAACAACGAAAGGCATCCAGCTGACT 

TTTTGATTCCMGATTATTGATTGGATTGACTTTTTTGCATTAMTTT^ 

TAMTCATATGGCGAGTCAGGGMTAAAAAAGTCAAAMGGAAACMATAGMGCTTTTT 

llll! IAAAAA 

Sequence 1616 

GGNCGAGCGCGCCTTGCGGGGGCGGTATCCCGGCGCCCTAAGACCCCACGACCNCNNGCA 
CCGGCCGNTGCTGCNAGACCCCGGCCCGNGTCGGTCCGATGTCGCCCCCCGGNCCCGGCG 
GAAACGCCTCCCTNCTGGCCAGGCTGTTCNAGCACCCGCTT 
Sequence 1617 

TCGCCACGCGTCCGTTNAGAT GCAGA MTGMMAAAMCACCTTTGTTTTATAAATATC 

MAGTACATGCTTAMGCCAAGTTrTTATCTAGTTTATTCTAGTACTTAGCTTGCCTGGA 

ATAGCTAATAMTTATTCATGTATGTGCTTTTGAAAATCCAGAGCCCTATTTTTACACAC 

TTGTGTGMGTTGGCAAACATTTTGAAAMTGGAAAMAGTTTCTAATAATTGGGAACAA 

TTACATTMTTMTATTTTGTAAMTATTGMGCTTTTAGCCCTATGTCAATTTGTAGAT 

TAAAATAAATTAATTATAGGAAAGGAAGATAACAGTGAGAAACCAAACATTAC 

Sequence 1618 

CACGCGTCCGCCCACGCGTCCGGCGCGGCGGGAGCGGGCGGNGCGAGCGGGAGGCGGCGG 

CTCACAACTGAAGCGCTGNGGCATGGNGCGCGCCTGCCTCCAGGCCGANNNGTACCTNAT 

GTTCGCCTTCAACCTGCTNTTCTGGCTGGGAGGNNGTGTGNGGGCTGGNTGTCTGGCNTC 

CTGGCTTGGGC 

Sequence 1619 
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TCGCCACGCGTCCGGACCTGGATGTGAGGGTGAANGGGCTGTGCTGCTGGGAGCCACATT 

CCTCATTGACTACATGTTCTTTGAGAAGCGAGGAGGCAGCTGGGCCCTCTGCCATCACCA 

GTTAGAGGCCACCATGGTGTGAGGAGACCATCACCTCGACCAGAACTCCAGATGGTCACC 

TGCCCTGGCCCCTCCTCTGGGCAGCCCCTTTCCTCCATGTACACTGCAGGGGACAGAAGG 

GGGGCCCCATCCCTACCCTACTCCCTGGCCGCCTGCCCCTGTGGTTCCCAAGGAAGGGGG 

TATTGTATTGAGAGCCGCTCTCCTGCTACCTCCCACCACTGTCCAACAGTCCCTCGGCAC 

ACAGGCATATTAAGCTTTCACACTTTTCCCATGCAC I I 1 I I ICCACCCC 

Sequence 1620 

GGAGTCGACCNCGCGTCCGGGGGCTTGCTGGGATCATGGCGGAGAATCACTGCGAGCTCC 

TGTCGCCGGCCCGGGGCGGCATCGGGGCGGGGCTGGGGGGCGGCCTGTGCCGCCGCTGCA 

GCGCTGGGCTCGGCGCCCTGGCCCAGCGCCCTGGCAGCGTGTCCAAGTGGGTCCGACTCA 

ACGTCGGCGGCACCTACTTCCTCACCACTCGGCAGACCCTGTGCCGGGACCCGAAATCCT 

TCCTGTACCGCTTATGCCAGGCCGATCCX^GACCTGGACTCAGACAAGGATGAAACAGGCG 

CCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAACTACCTGAGACACG 

GCMGCTGGTGATTAACAAAGACCTCNCGGAGGAAGGG 

Sequence 1621 

GTCGCCCCGCGTCCGGGGCCCGCGGGCCTCGCCTCCGCCCTCCGCCACCTCGAGCTGCGG 

TAGCAGCGACTCATGAGAGCGCGGCCGGAGGACAGATTTGATAATGGGCTGCATTAAAAG 

TAAAGAAAACAAAAGTCCAGCCATTAAATACAGACCTGAAAATACTCCAGAGCCTGTCAG 

TACAAGTGTGAGCCATTATGGAGCAGAACCCACTACAGTGTCACCATGTCCGTCATCTTC 

AGCAAAGGGAACAGCAGTTMTTTCAGCAGTCTTTCCATGACACCATTTGGAGGATCCTC 

AGGGGTAACGCCTTTTGGGAGGTGCATCTTCCTCATTTTCAGTGGTGCCAAGTTCATATC 

CTGCTGGTTTAACAGGGNGGNGGTACTATATTTTGNGGCCTTATATGATTATGAAGCTAG 

MCTCCAGAAAGACCTTTCATTTAAGAAGGGTGAAAGATTTCAAATAATTAACAATACNG 

AAGGAGATTGGTGG 

Sequence 1622 

TTCGGGAGTCGCCCCGCGTCCGCTTTTAGAAAAGGCCAATATACCTATCACACTTTGGAA 

GTAAAAATACACACTTTCGTGTGTACCTAAAAAAAAAATCGTTGAAAATCAAGGCCAAAG 

GTAGTGCAATTTTTTCATTAAGATTTAAMAAMGGGAATGATAGTCTTTGAAAGAAAAC 

AGTAGGCATCCAGCACTGGACAAAACATGGGTATCAAAGATGAATAATCTTTGGAGATTC 

TGGCAGTGTTTTCCCAGAACMGTCAAGTGGAMGTGGAGAMTTATCTGTATMTTTTG 

GACACATACAATGGCAGTTTATCAMGGGTTTTGTTCTGTGGCCTGAATTTACTGGGGTC 

CTACCTACACATTGAACATGTTTTGGCTGGC I I I 1 1 1 1 I 1 1 » 1 N I CAACTTGCCAGTTT 

CACTTTACATGGTTAGTAATAAATGGTTTCCACGGGGTGAGTTGGGATAAMTTNTTNAA 

AACATNTTAAATTCCA 

Sequence 1623 

GGAGTCGACCNCGCGTCCGAGCCGGGCCGGGGCGATGTGGAGCGCGGGCCGCGGCGGGGC 

TGCCTGGCCGGTGCTGTTGGGGCTGCTGCTGGCGCTGTTAGTGCCGGGCGGTGGTGCCGC 

CAAGACCGGTGCGGAGCTCGTGACCTGCGGGTCGGTGCTGAAGCTGCTCAATACGCACCA 

CCGCGTGCGGCTGCACTCGCACGACATCAAATACGGATCCGGCAGCGGCCAGCAATCGGT 

GACCGGCGTAGAGGCCGTCGGACGACGCCAATAGCTACTGGCGGATCCGCGGCGGCTCGG 

AGGGCCGGGTGCCCGCGCGGGTCC 

Sequence 1624 

CGCGTCCGGGGCAGCCGCGCCCGCGGAGTTTTCCGCCCGGCGCTGACGGCTGCTGCGCCC 

GCGGCTCCCCAGTGCCCCGAGTGCCCCGCGGGCCCCGCGAGCGGGAGTGGGACCCAGCCC 

CTAGGCAGAACCCAGGCGCCGCGCCCGGGACGCCCGCGGAGAGAGCCACTCCCGCCCACG 

TCCCATTTCGCCCCTCGCGTCCGGAGTCCCCGTGGCCAGATCTAACCATGAGCTACCCTG 

GCTATCCCCCGCCCCCAGGTGGCTACCCACCAGCTGCACCAGGTGGGTGGTCCCTGGGGG 

AGGTGCTGCCTACCCTCCTCCGCCCAGCATGCCCCCCATCGGGCTGGATTAACGTGGCCA 

CCTATGCGGGGGCAAGTTCAACCAGGGACTATCTTCTCGGGAATGGCGGCCAACATTGTC 

TGGGGACATTTGGAGGGAGCCAACATGCCCAAACCTGGACCCTGGGGCCCCTGGGGGCTG 
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Sequence 1625 

CACGCGTCCGGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGCGGCCTCCCGGCGGGAGC 

GAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGGCGGCTGCGGGCG 

CAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGGCGGCGGT 

CCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCGGC 

TCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAAGATGTTCCGCGAGGTTGAGGAAC 

TGATGGAGGACACGCAGCACAAATTGCGCGCGCGGTGGAAGAGATGGAGGCAGAAGAAGC 

TGCTGCTAAAGCATCATCAAGAAGTGGAACCTGGCAAACTTAC 

Sequence 1626 

CCACGCGTCCGGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAGGAAGCGGCAAAG 

GGGACCATGCGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTGCTCTGGATC 

ACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCCGACGGGACCTGA 

AGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGATGA 

GCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAAGCTGTATG 

CTTTCAACCAGCGGGAGAGTGAGGCGGGATCTCCAGCAATCGGGCCATCCCGGACACTCG 

CCATCTGGAGATGCACATGGCTTGGTGTATTGGACGGGACCTTCCACCCACTT 

Sequence 1627 

GCCACGCGTCCGCCGCCCGCTTGCCCGTCGGTCGCTAGCTCGCTCGGTGCGCGTCGTCCC 

GCTCCATGGCGCTCTTCGTGCGGCTGCTGGCTCTCGCCCTGGCTCTGGCCCTGGGCCCCG 

CCGCGACCCTGGCGGGTCCCGCCAAGTCGCCCTACCAGCTGGTGCTGCAGCACAAGCAGG 

CTCCGGGGCCGCCAGCACGGCCCCAACGTGTGTGCTGTGCAGAAGGTTATTGGCACTAAT 

AGGAAGTACTTCACCAACTGCAAGCAAGTGGTACCAAAGGAAAATCTGTGGCAAATCAAC 

AGTCATCAGCTACGAGTGCTGTCCTGGATATGAAAAGGTCCCTGGGAAGGANGGGGGCTT 

GTCCAAGCAAGCCCTACCACTCTCAAACCTTTACGAGACCCTGGGNAGTCGNTTGGATCC 

ACCACCACTCAAGCTGTACACCGACCGCACGGAGAAGCTGAGGCTGAATGGGGAGGGGCC 

Sequence 1628 

CCTAAGGGCAACAAGGGCGGTCCTGGCCAGCCGGGCTTTGAGGGAGAGCAGGGGACCAGA 

GGTGCACAGGGCCCAGCTGGTCCTGCTGGTCCTCCAGGGCTGATAGGAGAACAAGGCATT 

TCTGGACCTCGGGGAAGCGGAGGTGCCGCTGGTGCTCCTGGAGAACGAGGCAGAACCCGG 

TCCACTGGGAAGAAAGGGTGAGCCCGGAGAGCCAGGACCAAAAGGAGGAATCGGCAACCG 

GGGCCCTCGTGGGGAGACGGGAGATGACGGGAGAGACCGGAGTTGGCAGTGAAGGACGCA 

GAGGCAAAAAAGGAGAAAGAGGATTCCCTGGATACCCAGGACCAAAGGGTAACCCAGGTN 

AACCTGGGCTAAATGGAACAACAGGGACCCAAAGGCATTNAGAGGCCCGAAGGGGA 

Sequence 1629 

AGTCGCCCCGCGTCCGCTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAG 

CAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTG 

GAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGT 

GAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTC 

AGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTT 

AGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGGAACCTGGAAGATACCCTTCAAGGAG 

TCAGCAGAGCACACCCAAAACMCTGTTTTCTTCTAAAACTGGGTAGAAGCCTAAGAAAG 

ATAAACTCTGCTCCTCCTACAGAAACAAAATCTT 

Sequence 1630 

TNCGGGCCTGGTGAGCACCGCCGAGGCGCGGGCCAGCTCTTCGAGGTTGTGCGCGGGAGT 

GGCACGGCGGGCCGGGCCGAGCGAGGGGCTAACTTCAGCGGTGGCACCGGGATCGGTTGC 

CTTGAGCCTGAAATCATGACCACCCCAGGAAAAGAGAACTTTCGCCTGAAAAGTTACAAG 

AACAAATCTCTGAATCCCGATGAGATGCGCAGGAGGAGGGAGGAAGAAGGACTGCAGTTA 

CGAAAGCAGAAAAGAGAAGAGCAGTTATTCAAGCGGAGAAATGTTGCTACAGCAGAAGAA 

GAAACAGAAGAAGAAGTTATGTCAGATGGAGGCTTTCATGAGGCTCAAGATTAATAACAT 

GGGAGATGGCCCAGGGTGGGTGTCATCACTTCTGACATGATTGAAATGATATTTTCCAAA 
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AGCCCAGAGCMCMGCTTTCAGCAACACAAGAAATTCAGGAAGCTGCTTTCAAAAAGAA 
CCTAA 

Sequence 1631 

CGGAGTCACCACGCGTCCGGGGCTGCCAAGGGAGGAGGAAGATGGCGGCGGGGGCGAGGT 

GAGGTGTTGGCAGTGGAAAGGGGTTCGGGCTCGGGGGGCGGGGGGACGCGGAGCGATGGC 

CCGCGCCGGCCGCAGGGGCGGATAAAAAGCCGTCGCGCTGCGGGAGTGGGCGGGAGGGAG 

AGGGGGTGTCCGAGGGCCACAAGAGTATGACGGGGCTGTACGAGCTGGTGTGGCGGGTGC 

TGCACGCGCTGCTCTGTCTGCACCGCACGCTCACCTCCTGGCTCCGCGTTCGGTTCGGCA 

CCTGGAACTGGGATCTGGCGGCGCTGCTGCCGCGCCGCCTCTGCGCGGTCCTAGCGCCGC 

TCGGCTTCACGCTCCGCAAGCCCCCGGCAGTCGGCAGGAACCGCCGTCACCACCGGCACC 

CGCGCGGGGGGGTCGTGCCTGG 

Sequence 1632 

CGTCCGTTTGTTTAATA I I I I 1 1 I ICTCTCTTGAACAAAACTGAGATAATTTAGAAAACA 

GGTGCTTAATTGCAATAAAATTACTATGAAGTATATTAAAAATCACGACATTGTAAAATC 

TCACTTTAGATCATCAAAGAAMCCATTGTTACTATCTGCTTTGAGCTTAGGAAAATGTA 

CAAGAGAACAMTTAAMTTGAAAMTTGATTTCACTTAGAAAAACTTCTAGGAACAGGG 

TGAACCACTGATTTTMTTTGCCTAATTATCTTATGACAAGTATCAAATTAAGATGACAC 

TTAMGGATCCTTAGCATTTAACTTAATGATGGAGAAAGAGTGCTCAATAGGACAGTTCC 

CCAGTTAAGGGGTMTGGAGATGCCCATTTTCAGGAGGACCATTCTAAGMGATATTTTT 

G G ATTCATT AAAAAC ATTT AAATAAAA AG CCCTTCTTCAAG ATTG G G AAC 

Sequence 1633 

CGCGTCCGCCGGCCTGGTGCACGCGGCGCACCGAGGCCTCCCGCAACGCCGCCGACAAGG 

AGCGGGCGGCGGGCGGCGGCGCCGGCAGCAGCGAGGACGACGCGCAGAGCCGCCGCGACG 

AGCAGGACGACGACGACAAGGGCGACTCCAAGGAAACGCGGCTGACCCTGATGGAGGAAG 

TGCTCCTGCTGGGCCTCAAGGACCGCGAGGGTTACACATCATTTTGGAATGACTGTATAT 

CATCTGGATTACGTGGCTGTATGTTAATTGAATTAGCATTGAGAGGAAGGTTACAACTAG 

AGGCTTGTGGAATGAGACGTAAAAGTCTATTAACAAG 

Sequence 1634 

CCCCGCGTCCCGGTTGGCCGGGCGGAGGTCTTCGCTGAGGCCCGGGGCGGGGTGGCGCCA 

CCCCTGATTGCC3GTGCCACGGACTGCTCCTGCTGGGCGGAGAGGACAGATTTTGCAAAGC 

GGAGGCTTGCGACGGGTCCTGCAGGGGGACAGTGAGGAAAGGGCCCGCCTCGTNTCCGCT 

CCTGGGGGACCCGCAGAAATAAGAATCAAACTCCACAATGACAACCTATTTGGAATTCAT 

TCAACAAAAATGAAGAACGAGAATGGGAGTCCCGATTTAGT 

Sequence 1635 

CCACGCGTCCGGGCGGGGCCATCCAAGCAACGCTGAAGGCCTTTTCCAGCAGCTGGGAGC 

TCCCGGATTGCGTGGCACAGCTGAGGGGCCTCTGTGATGGCTGAGCTCTCTTATGTCCTA 

TACTCACATCAGACATGTGATCATAGTCCCAGAGACAGAGTTGAGGTCTCGAAGAAAAGA 

TCCATGATCGGCTTTCTCCTGGGGCCCCTCCAATTGTTTACTGTTAGAAAGTGGGAATGG 

GGTCCCTAGCAGACTTGCCTGGAAGGAGCCTATTATAGAGGGGTTGGTTTATGTTGGGGA 

GAATTGGGCCTGMTTTCTCCACAGAAATAAGTTGCCATCCTCAGGTTGGCCCTTTCCCA 

AGCACTGTAAGTGAGTGGGGTCAGGCAAAGCCCCAAATGAGGGGTTGGTTTAGATTCTGA 

CAGTTTGCCAGCCAGGCCCCACCTCAGCGTCTGTCGAACAAACAAAGNTNGGGNGGGTTT 

N 

Sequence 1636 

CCNCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCTTCTGCAGCAAGCTCAGGAGAGCT 

GCTGTCTTCCCTCCCGCCCACCAGCAACGCACCCTCTGACCCTGCCACAACTACTGCAAA 

GGCAGACGCTGCCTCCTCACTCACTGTGGATGTGACGCCCCCCACTGCCAAGGCCCCCAC 

CACCGTTGAGGACAGAGTCGGCGACTCCACCCCAGTCAGCGAGAAGCCTGTTTCTGCGGC 

TGTGGATGCCAATGCTTCTGAGTCACCTTMCTTTGMCCATTCTTTGGAATTGGCGTGG 

TATATTTAACCACGGGAGGCGTGTCTGGAAACGCAAACTATCATTAATTTCATACTAGGT 

TTGTACCGTATCTGTAGGCATTCCTGTAAATAATTCCAAGGGGGAAAACTAAACNGGGAC 
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GTGGGGTTGTATCCTGCCAGGTTTGAGTGGGGGCTCACACCTAGGGTGAGAAGTCAGAAA 

GCGCTTGTATTTTAAACAACCAAAAAGAATTGAAAGGGTG 

Sequence 1637 

CGTCCNATAGGCTTGCACACTTTTCTAACTACATGTTTAAGTGGCAGAGTCCAGGCTGTC 

GAGTCACGGTTGGGT TTGA ATCTGACTCCACCAGTAACTTTGGTTGGAAAAAATCACTTA 

TCCTCTTTAAG CTTGA TTTTATTTATTTTATTTTATGTAAGAGTGAGACAGTAGT^ 

MTAGGGTTGCTTTTAAATTAGAGTGAACATGAGGCATTTATTCGGTGCCAGACAGATAA 

CTGCCTATAACAGGATGTGATCAGCACAAGTAACAGAAAATTAGCCTGGACGGTGGCTTA 

AGCMTGGGGMTGTTTATCTCACATAGCMAMGGTCTGTAMTAGGATGGTTTTAGAG 

TTGGGGTGGGGAAGCCAAAAATGTCATCAGGATTTCCTTGGAACCCGT 

Sequence 1638 

CGCGTCCG GATTAA TACMCTCTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTT 

AGCGTTAAG J 1 1 1 I AACCGTMTTTTMTAGCTTMGATTTTAAGAGAAAATATGAAGAC 

TTAGAAGAGTAGCATGAGGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAGAAGGAAAG 

AAGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAACCCACCATGCCCCAGGCTCAGCAG 

GGAGCTGCTGGATGAGAAAGGGCCTGAAGTCTTGCAGGACTCACTGGATAGAAGTTATTC 

MCTCCTTCAGGTTGTCTTGMCTGACTGACTCATGCCAGCCCTACAGAAGTGCCTTTTA 

CGTATTGGAGCAACAGCCGTGTTGGCTTGGCTGTTGACATGGATGAAATTG 

Sequence 1639 

CGCGTCCGGCTCCCCGCACCCCCTCGCACTCNCTCTGGCCGGNCCAGGGCGGCCTTCAGC 

CCAACCTTGCCCAGCCCCACGGGCGCCACGGAACCCGCTNGATCTCGCCGCCAACTGGTA 

GACA 

Sequence 1640 

GTCGCCACGCGTCCGGCGGCCGGGCCGGGGCAGCCGGGAAGCGGGTGGGGTGGTGTGTTA 
CCCAGTAGCTNCTGGGACATCGNTCGGGTACGCTCCACGCCGTCNCAGCCACTGCTGTGG 
TCGCCGGTC 
Sequence 1641 

CGTCCGCTCCTCCCGCCTGAGGCGAGTCTGGGCTCAGCCTAGAGCTCTCCGGCGGCGGCG 

CAGCTTCAGGGCAGCGCGGGCTGCAGCGGCGGCGGCGGTTAGGGCTGTGTAGGGCGAGGC 

CTCCCCCTTCCTCCTCGCCATCCTACTCCTCCCTCCTCGTCATCCTCCCCCTTCGTCCTC 

CTCGCCTTCCTCCTCCTCGTCAGGCTCGACCCAGCTGTGAGCGGCAAAGATGGGCGGCGC 

CCAGGCCGCCGCCTGCCAGGCTGTCGGGGCGTCATGGTGCCGGCGCCCATCCAAGACCTG 

GAGGCCCTGCGCGCGCTCACGGCGCTCTTCAAAGGAGCAGCGGAACCGAGAAACAGCACC 

CAGGACTATCTTCAAAAGGAGTTCTGGATATCCTAAAGAAATCTTTCTATGCTGTTGAGC 

TTGCCTGCANGAGATCCATCCCAAGTGGAAAACCT 

Sequence 1642 

ACATTTATCATGGATGCTGACCGGGAGAAAGAAAGAAAGAAACGGGAGGAGCGGGAGCGT 

AAGCGGCGGAAGGAGGAGGAGGTGCAACAGCCAAAGTTGGCAGAGGAGAGACGGCGGCAG 

AATTTACAGGAGGAAAAGGAAAGGAAGTTGGAATGCCTGCCCCCTGAACCTTCCCCTGAT 

GACCCTGAAAGTGTCAAGATCATCTTCAAATTACCTAATGATTCTCGAGTAGAGAGACGA 

TTCCACTTTTCACAGTCTCTMCAGTMTCCACGACTTCTTATTCTCCTTGAAGGAAAGC 

CCAGAAAAGTTTCMGATTGMGCCMTTTTCCCAGGCGAGTGCTGCCCTGCATCCCTTC 

AGAGGAGTGGCCCAATCCCCCTACGCTACAGGAGGCCGGACTTAGCCACCAGAAGNTCTT 

TTTGTTCAGGACCTMCTGACGMTGACATTTTTTTCTTTCTGTCCCCTCCTACCCCAGT 

CCCTAAAAGAAATGGGGNAAAAAGGAAACAACAGCAGTCNTAAAAAAAAAA 

Sequence 1643 

CGCGTCCGGAGGGGCTAAGAAGGTTGTCCTTGCCTAATGCTCTGATCTGTAAGTGAATAG 

GGCAGMCAGTTCAGCCTTGAGGTTAGAATTTAGCAGGAGCTATCCTGACTTAATATCCA 

GTTGTGGGGTTTGCAAAACAAAACAGCTGTATGTAATCATTGCCACTAGTTCCATCTAGA 

ACTCCTTTCTAGTTTGTTATTTTTAAAATGTTTATACATAAAACCACCAAAATACATAGC 

TTCGACAAGATGGAAGTTTATTTCTCTCTCCCATAACAGTGCAGTGATAGTCAGCTGGTC 
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CAGGCCAGGCAAGGGGCTGGTCCATGATGTCATCAGGCACCCAGGTTCCTACTGGCTTTG 

CATGTGGCCACAGTTAGCAACAAANGGAGGCTGTAAATTT 

Sequence 1644 

CGCCACGCGTCCGGTGATGCGGACCCCGGGCGGGCGCAGGGCGCGGGCTCCGGCGCCGCC 

GCTGCGTCCTCCCCGGCCGCGGGCGAGCCGCTGCAGAGGGAGCGTCGCGCCGGGGCGGAG 

TGCGGGCTTGCGCGGCAAGTGCGCGCCGAGGTCACGAAATGGATTGGAGTGAACCGGAGA 

CCCCGAAAACGGAAGCGCAGGGAGAAGGAAGAGGTGTTTGAAAAGCTTCTTCCAGACCAG 

CTGGTCTTGCTTCTGGAGCATCTCTTGGAGCAGAAGACTCTGAGCCCCCGAACTCTGCAA 

AGCCTCCAGAGGACATACCACCTCCAGGATCAGGATGCAAGAGGTTCGCCATCGGTGGTG 

TGAACTCATTGTTAAGCACMGTTCACGAAAGCCTACAAAAGTGTGGAGAGGTTCCTTCA 

GGGAGGATCAGGCCATGGGGTGTGTACCTCTACGGGGAGCTGATGGTTGAGTGAGGACCC 

CAGAC 

Sequence 1645 

TCGCCACGCGTCCGGGACATCGAGTNCGGGCTGGCTACGAACTCCTCGGGGGCGAAGGTG 

GCGGAGAGGGATGGGTTCCAGGACGTCCTGGCGCCCGGGGAAGGCTCGGCGGGACGGATT 

TGCGGTGCGCAGCCAGTGCCGTTCGTCCCTCAGGTGCTTGGCGTGATGATCGGGGCCGGA 

GTGGCGGTGGTGGTCACGGCCGTGCTCATCCTCCTGGTGGTGCGGAGGCTGCGAGTGCCA 

AAAACCCCAGCCCCGGATGGCCCCCGGTATCGGTTCCGGAAGAGGGACAAAGTGCTCTTC 

TATGGCCGGAAGATTATGCGGAAGGTGTCACAATCCACCTCCTCCCTCGTGGATACCTCT 

GTCTCCGCCACCTCCCGGCCACGCATGAGGAAGAAACTGAAGATGCTCAACATTGCCAAG 

AAGATCCTGCGCATCCAGAAAGAGACGCCCACGCTGCAGCGGAAGGAGCCCCCGCCCGCA 

GTGCTAGGAAGCTGAC 

Sequence 1646 

TCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGACGAGGGGCCGGGTTCTGCG 

GCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAGCTTCTTCACCGACGGGTC 

CTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTCTAAGCGACACTCAACCTC 

TGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGGCTGGTTGTATTATAAGCA 

GATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACATGCTGGGCTGGATCAGAGC 

GATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCGGCTGTGCCAACCAAGCTCTGAT 

CAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGCTCTGGGCCCAAAGCTGATTC 

CTCCCCCAAAGGCTCTCGCGGCCTGGGGGGCCTCAAGTCTGAGTTCCTCAAGCAGAGTGC 

GGCCACGTGGCCTCANGACTCAAGACCTGCCCGCAGGGAGCAAGGATGACAGTGCTGCAG 

CCCCCAAAACCCC 

Sequence 1647 

ggtgtcgccccgcgtccggtttcttcctaatttatatttccgatacatangtgtagaaca 
ggaatttgcagmgccatttaagttatcttttgaggtmngctctgatttagcatttatt 
ctgataa aatct aatacatcatgggatatatataaagcaacttaattcttgtggtgtagt 
cttaatagttttgaatgttgactgmtc^ 

tccagtgtttctgcctcttggcatgcttamgcacggcttacttcatctgctccttacac 

actaaaatgctg ttagtg tgctcaactacagaaatagccgctgctaagttgatgtagatt 

ttctacttgm tattt ttatggttgtaggmcctcaggagggtcagtgtttactggttta 

tatatgccttctttttcctgtttgagcttctctctttgmgggattctaacagaacaaaa 

gctgctgatcaccctaagttggaaacagnaaaggngtaattaatataacttaatgc 

Sequence 1648 

TCACCACGCGTCCGAAAGTCCGTGACATGGTTCCCCGTGGTGGCCCGTGGCAGCCCGTGG 

CATGGCGTGGCTCAGCTGTCTGTTGAAGTTGTTGCAAGGAAAAGAGGAAACATCTCGGGC 

CTAGTTCAAACCTTTGCCTCAMGCCATCCCCCACCAGACTGCTTAGCGTCTGAGATCCG 

CGTGAAAAGTCCTCTGCCCACGAGAGCAGGGAGTTGGGGCCACGCAGAAATGGCCTCAAG 

GGGACTCTGCTCCACGTGGGGCCAGGCGTGTGACTGACGCTGTCCGACGAAGGCGGCCAC 

GGACGGACGCCAGCACACCGAAGTCACGTGCCMGTGCCTTTGATTCGTTCCTTCTTTCT 

AAAGACGACAGTCTTTGTTGTTAGCACTGMTtATTGAAAATGTCAACCAGATTCTAGAA 
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ACTGCGGTATTCCAGTTCTTCTGACACCGGATGGGTGCTTGGGAACCGTTTGAGCCTTAT 

AGATCATTTACATTCAATTT 

Sequence 1649 

CNCCACGCGTCCGGGGATCCCTGGGGAGAGTAACAGTGGCCCCACATCCCTCCTCCTGGG 

AGACGCTGGTGCAGGGCCTCAGTGGCTTGACTCTCAGCCTAGGCACCAACCAGCCCGGGC 

CTCTGCCTGAAGCGGCACTCCAGCCACAGGAGACAGAGGAGAAGCGCCAGCGAGAGAGGC 

AGCAGGAGAGCAAAATAATGTTTCAGAGGCTGCTCAAGCAGTGGTTAGAGGAAAACTGAG 

ACGTGCACCCCCATGGGATGGAGACCCGAAGGGACTCAGACGGAGCCGCCGTGTTGGCAG 

CGCCTGGGTGTGGGCCCATTTTGGGGACCAAACAGCAAGCTGTGGTCGGATGAGTGCCAG 

GACCTGTGTACCGGGACACGTGGGGAGTCCTCCCAGCATGATGCTTGACTGACCCGAGGA 

AGGTCCTCATGTTTCGTGCCTGTCATTCTCGGATGGCTGTGAGGCATTCCTTGGCAAGGG 

ATGCTTGCGTACCAAGCGGTCCTACCGCATCTACATGGCTTCTGTGATGCATGTTGTCGC 

TTTCCCACCCNGGAT 

Sequence 1650 

CGCGTCCGAGCTTTGCAGGGAAGAACAGAGTATGGGTCATCTCAGCCCCTCATGCCTCGG 

AAGGCTACTACCGCCTCATGATGAGCCTGCTGAAGGACGATGTGTACTGTGAGCTGGCGG 

AGAGGCACATCCAACAGATTGTGCTCTTCCACCAGGCAGGTGAGGAAGGAGGCAAGGTGA 

GAAGGATCACCAGCGAGGGCCAGATCCTGGAGCAGCCCCTGGACCCTAGCCTCATCCCTA 

AGCTGATGAGCTTCCTGAAGCTGGAGAAGGGCAAGTTTGGCATGGTGCTGCTGAAGAAGA 

CGCTGCAGGTGGAGGAGCGCTATCCATATCCCGTTAGGCTGGAAGCCATGTACCGAGGTC 

ATNGACCAAGGCCCCATCCGTAGGATCGAGAAGATCAGGCAGAAGGGCTTTGTCCAGAAA 

ATGTAAGGCCTCTTGGTGTAGAGGGCCANNGTTGGTTGNCTGAGGGGGAATTGACCCGTT 

GGAAGGGGGAAGCAATGAAAGGGCCAAAG 

Sequence 1651 

CGCGTCCGGGATGCGTTGGGTCTGAAAGTCGATGAAGGACGCGATTACCTGCGATAAGCT 

TCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGAATGGGGTAACCTAACTGAGC 

MACCTCAGTTGCATTTTGATGAATCCATAGTCAAATTAGCGAGACACGTTGCGAATTGA 

MCATCTTAGTAGCAACAGGAAAAGAAAATAAATAATGATTTCGTCAGTAGTGGCGAGCG 

AA AGCG AAAGAGCCCAAACCTGTAAAAAGGGGTTGTAGGACATNTTACATTGAGTTACAA 

MTTTTATGATAGTAGAAGAAGTTGGAAAGCTTCAACATAGAAGGTGATATTCCTGTATA 

CCGAAATCATAAAATCTCATAGATGTATCCTGAGTAGGGCGGG 

Sequence 1652 

GTCGCCNCGCGTCCGCAACATTATTGAGATTGTCGTGTATAGTCATCGAATATCAGCCAG 

TTCCTGTMTTTTGTGACACGCTCTCTGCCMGCCCACCAAGTATTTCCTTTATAGCTAA 

AAGTTCCATAGTACTAAGGAAATAAAGCAATAAAGACAGTCTCAGCAGCCAGGATTCTGG 

CTGAAGGAAATGATCCGCCACCCTGAGGGTGGTGATGGTAGTTTCTACCCATACCTCAGC 

CTCAGGCGAGTGGCTTATAGCCTCCATTCATGGTGCACTTTATTTATGGTACTAAGATAA 

AGACTGTCAATCCATTGATTTATCTCCTCCTGTCCCCCATCTAAAATACCCATGCTGCTT 

TTCTGGAGTGTTGTGGGGGGGTTACCAGCTTGATCCACTGGTGCTCTTTAAGAAGGCCCA 

AGAAAGGTCTTTGGGGCATTGCCAAAGAAAATCCCGGATTTATGTGGGAAAACCCTCACT 

TTTCTCTTTACNGGCTGGTACCAAGA 

Sequence 1653 

CCGTCCGI I 1 I I I I I I I GAACTTACCGTAAAA I I I III I I I AAAAAGTGCTTGTAAAAAT 

TAAAGAGGAATAAAAGGGGGGTGAACAGCCAGTACGATAGTGCATGCCTGAAATTCCAGT 

GCTTTGGGAGGCCGAGGCAGGAGGATCGTTTGAGGCCAGTAGTTGGAGAGCAGTGTGGGC 

MCGTAGCAAGACCCCATCTCTACAAAAAATTTAAMGTTAGCCGGGCATGGTGATTCAC 

ACCTGGAGTTCCAGCTGCTGTGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAGGAATTT 

GAGGCTGCAGTGAGCCATGATTGCACCACCGCACTTCAGCCTAGGTGACAGAGCAAGGGT 

CTACCTCAGAAAAAAAAAAAAAGGAGGAGCAAGCACGTGTTGATGGGTGGAAATCCAGCC 

AGAAATGCTGAGGCTGAAAAGATTGTTCCCAGTTTNCTGGTAGCCAGGGGAAAAGGGGAA 

ATT 
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Sequence 1654 

CGCGTCCGCTGACATTTCCTAGGAAGCTNGGAAAAGGAAAGTGAAGGAATGGTCTAAAGA 

MTGACCATTCACACTGATTTTGTCTGGACAGTCTGGCCGCAGGTTATTCATAGATTATT 

CAGCCTTTGCAGGACTTGGATTCAGGGTTTTACTCAGTCCCTTTACCTTAGGTGGAATCT 

TCTTAAATTGAMTTTTTGGTMGGMTTCATTTCACAGGTAGTGTTTCAGACTCTGAAA 

GCCCTGACTTGGTTCTTGGCTTCTACTGTACTAGTTACTAGTTACTGGTACTGTTGCCAA 

GCMTCTGCTTGGAATTTGTGGATCCCTGCTGTTCCCTAATCCCACCCCCTGCCCTGAGA 

CAGTGAATGTAGTCCGTGAAGGGAGTGCCTCTCTGGGACCCCCTGTGTTGTTACAGGCTG 

TGCAGTGCAACAATTCCAGCAAAAATACCCTATCCCCGCACTTAGTCATTCGTGGTAACT 

AACAATTTTGAMTACTCATATAAAATGAACAGGAAAGTGGTTAGTG 

Sequence 1655 

GACCACGCGTCCGGACCCAGACCCGGCTGACCCACCTACCCGCGATCCTGCCCATGGCTG 

ACGGGCTCTTTCGGCGCAGACCCTGGGGTCTCGAGCAGATTCGCCCGGACCCCGAGTCCG 

AAGGCCTGTTTGACAAGCCTCCCCCGGAAGACCCTCCCGCTGCCCGCGGGCCCAGGTCGG 

CGTCGGCCGCGGGCAAGAAGGCTGGTCGGCGCGCGGGCGGGAGGGCGCAGGGGGGCCGCG 

CCGGGCAGCCCCCGAAGGCCGCATCGCGCCCCCCGCCCAAGAAGGAGGCGCCTCCACTGG 

ACGAGGGCTGCTATCTCGACCATTTTCCGCACCTCTCCATCTTCATCTACGCAGCCATCG 

CCTTCTCCATCACCTCCTGCATCTTTACCTATATCCATTTACAGCTTGCCTGAGTGGCCA 

GCGCGGGACGGGGTGGGCGCAGGACCGAGCGGGGAGGGAAAGGGGAAAACGGGGGCTNGG 

CATTTTGTGTTTTAG 

Sequence 1656 

CNCTAACCCCGAAC TCTAG ATCGTCTTGCTTGTTTGTCTGAAGAAGGGAATGAAATAGAA 

AGTGGAAAAATMTATTTTCAGAGCATCTTCCCTTAAGTAAGCTACAGCAAGGCATAAAA 

TCTGGTACATACCTTCAAGGAACATTTAGAGCTAGCAGGGAAAATTACTTGGAAGCTACA 

GTATGGATTCATGGCGACAGTGAAGAAAATAAAGAGATAATCTTACAGGGACTTAAACAT 

TTAAACAGAGCTGTTCACGAAGATATTGTGGCTGTGGAGCTTCTCCCCAAGAGTCAGTGG 

GTAGCACCATCTTCTGTGGTTTTACATGATGAAGGTCAAAATGAAGAAGATGTGGAGAAA 

GAAGAAGAGACAGAACGAATGCTTAAGACTGCTGTAAGCGAGAAAATGTTGAAGCCTACA 

GGTAGGAGTTGTAGGAATAATAAAAAGGAATTGGA 

Sequence 1657 , 

CGTCCGCGGACGCGTGGGCGGACGCGTGGGCTGGCTGTATCTATACTTTCCTTGAGAAAA 

ATCCCATAAAGTGGATGGACCTGTGAAGAAAATGTATGCTTATGGCCTAGCCTTCATGTC 

TGGCTGATGTATCCTATAAGGCAGTAAGCCCCTTTTCTAGTCTCTGGTAAGATGCAAGAG 

CTCATATCCCCATCACTGACATTTTAGTTTGGAAATAATATTGAGACTGTGCTATGACCA 

ACCCCTGATGTTG7TTTTTCT1TTCAM 

GTTAAGTATTTATGTCTGGGGGGATACCTTCAGGTGTCTTATCTGTTTTATGCAAGAATT 

TATGTGTTCATCTTTATTCAGTGCAAAGA I I I I I I I I I AAATTTTGTTTATAATTGGAGG 

TAACATTAAGACAACTCTTNCTCCACAAGAAAACCTCTAAAATTAATATTCCTTAAGATT 

TGGTTTTCCTTTGCCTTATMTATTACCTT^ 

AAAAG 

Sequence 1658 

GTCGCCCCGCGTCCGTTTGATATACCACTCTGATAACTCATATAAAAATATCATCATAAA 
MGCTTMTTTCATCCCTTTTATGTTGGTTTTAAMGGT^ 

ATTGAGMCTCTTACATAGTAGMTCCATTCTATMTACATGTGTTGACAMGCTTTAGA 

GAMGTTTCCTATTCTCTTCCATTTCCCCTGCCCAAAGTGCTGACATAGGCAGTGATGAA 

GMTCTTTACCMGATTTTCAGGGTGTACCTATGAMTTGCTTTAAATGCACTGCTGGTG 

TAAATAATTAGCAAGCAAAAGCGTTTCTGTGACTTCAGGTACCAGCTTAAAGAGCACTAG 

GGATGGGGAACGAATGCCAAATCAGACTCCACCTAGAGCACCAGGAAACAGCTTGTCCCT 

GGTAGGGAAATGGTGTTGCTGAAAG 

Sequence 1659 

CGACCNCGCGTCCGGCTGNTGACCCCATGCTGAGTGGCCNGTGGGGAGCGGCGCCCGGCA 
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GGCTCTTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAG 

TTTTTCCCMGAAGMAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAGTTGGAGAA 

GTAACTGCCCATGCCCAGAAATAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGT 

GTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCtGC 

CAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGA 

AGGCAGAACAAAGAATGCATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATC 

GGAAACCTGTGAGTCANAGTCAGAGAAACTTACCCAGCCACGTATTCCTGTTTCATGGGT 

NCTGTAGATGTTTTGAGTCAAGGAAGGTA 

Sequence 1660 

TCGACCNCGCGTCCGGTGGGTCCCTGCCGGCCGGCGGCGGGCGCAGACAGCGGCGGGCGC 

AGGACGTGCACTATGGCTCGGGGCTCGCTGCGCCGGTTGCTGCGGCTCCTCGTGCTGGGG 

CTCTGGCTGGCGTTGCTGCGCTCCGTGGCCGGGGAGCAAGCGCCAGGCACCGCCCCCTGC 

TCCCGCGGCAGCTCCTGGAGCGCGGACCTGGACAAGTGCATGGACTGCGCGTCTTGCAGG 

GCGCGACCGCACAGCGACTTCTGCCTGGGCTGAGCTGCAGCACCTCCTGCCCCCTTCCGG 

CTGCTTTGGCCCATCCTTGGGGGCGCTCTGAGCCTGACCTTCGTGCTGGGGCTGCTTTCT 

GGCTTTTTGGTCTGGAGACGATGCCGCAGGAGAGAAGAAGTTCACCACCCCCATAGAGGA 

GACCGGCGGANAGGGCTGC 

Sequence 1661 

GGTGTCGACCNCGCGTCCGGCGCCCCGCTCGCATTGTTCGGGCGACTCTCGGAGCGCGCA 

CAGTCGGCTCGCAGCGCGGCACTACAGCGGCCCCGGCCCGGCCCCCGCCCGGCCCCGGCG 

CAGGCAGTTCAGATTAAAGAAGCTAATTGATCAAGAAATCAAGTCTCAGGAGGAGAAGGA 

GCAAGAAAAGGAGAAAAGGGTCACCACCCTGAAAGAGGAGCTGACCAAGCTGAAGTCTTT 

TGCTTTGATGGTGGTGGATGAACAGCAAAGGCTGACGGCACAGCTCACCCTTCAAAGACA 

GAAAATCCAAGAGCTGACCACAAATGCAAAGGAAACACATACCAAACTAGCCCTTGCTGA 

AGCCAGAGTTCAGGAGGAAGAGCAGAAGGCAACCAGACTAGAGAAGGAACTGNNAACGCA 

GACCACAAAAGTTTCACCAAGACCAAGACACAATTATGGCGAA 

Sequence 1662 

GACCACGCGTCCGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATT 

TTCCTGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGT 

CCTTAGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATC 

CCCGGATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTNCTGA 

CCTACGTGTACTTCGTTCTCTCACTTGGGCCTCGNATCATGGCTAATCGGAAGCCCTT 

Sequence 1663 

GTCGACCACGCGTCCGGGCTCCATCCGGGCTATCCTGCCGCCTTAGCGGCTGCTTCTCCC 

CAGGATGCGGGCAGGGGGCCTCTCTCCCACTCCCCACACACCGATTTCTGAGTAGCGATA 

GGGGCTGGAGGCTTATTTTATGGGGTAGGGGGCCGCTGGTAGGCGAAGATTGTCCGAGGG 

AGAGGGGGAGGATGAAGCCAGTGCGTGGCGGAGACTTGCCAGATGTTGATGCCTAAGAAG 

MCCGGATTGCCATTTATGMCTCCTTTTTAAGGAGGGAGTCATGGTGGCCAAGAAGGAT 

GTCCACATGCCTAAGCACCCGGAGCTGGCAGACAAGAATGTGCCCAACCTTCATGTCATG 

AAGGCCATGCAGTCTCTCAAGTCCCGAGGCTACGTGAAGGAACAGTTTGCCTGGAGACAT 

TTCTACTGGTACCTTACCAATGAGGGTATCCAGTATCTCCGTGATTACCTTCATCTG 

Sequence 1664 

CCGCGTCCGGGGGTTGGTACCCGAGCGCCTTCCCCTCACCTCAACCAGAGAAGAGCATCC 

GGTTGCTTTTTAAAGCTTTTAGCCTGCCCTAGCMGGACAMGCATGTTAGATTAGAGAT 

GCTTCTGCTGATCGCAGGGGTTCTTATTTGAAAACATCTATGATGGGGGTGGGGTGGGAG 

GAGACAGGTTGTGGTTATGCAGGAAAATCTTGTCCTAAAAATATATGAGTTTGGGGGTAA 

GG GGTG GGATAGCCMGCAAMTCAGTMTTATTTTAAMTGAACATATGTATTTTTATT 

AACTTTTAGTTAAATACAGATTTTACAACGAGGTCAGCATAAGCCTAAATCTATATAGAG 

GGCTAACTCAGGCATTGTCTTGTTTATTTGTAGACTGGATTAAAAACAACCTGTCCTGTT 

TTGTNAGTTCCCAGCTTCTTTCGTTTAGAATAAATTAGACCAAAAGAA 

Sequence 1665 
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GCGTNCGCTTAATGTCAATGTGGCCTGGGCTGGAGGTCTGGACCCCCCCATGGGGGATCC 

TGAGTACCTGGCTGCTTTCAGGATAGTCGTGATGCCCATCGCCCGAGAGTTCTCTCCAGA 

CCTAGTCCTGGTGTCTGCTGGATTTGATGCTGCTGAGGGTCACCCGGCCCCACTGGGTGG 

CTACCATGTTTCTGCCAAATGTTTTGGATACATGACGCAGCAACTGATGAACCTGGCAGG 

AGGCGCAGTGGTGCTGGCCTTGGAGGGTGGCCATGACCTCACAGCCATCTGTGACGCCTC 

TGAGGCCTGTGTGGCTGCTCTTCTGGGTAACAGGGTGGATCCCCTTTCAGAAGAAGGCTG 

GAAACAGAAACCCAACCTCAATGCCATCCGCTCTCTGGAGGCCGTGATCCGGGTGCACAG 

TAAGTGTGGAGATGGGACACTCGCTGAGCTCAGACTGAAGGATCTTGGT 

Sequence 1666 

CGACCNCGCGTCCGGTGTGATGATCGCTACTGCTGGAGACCGCACAGAGGAGTTCCACGG 

CCACNGCAGTGAACCTCCTGGGGAACTTGCCCCTCAAGTGTCTGGATGTTCTCCTCACCC 

TGGAGCCACATGGAGACTCCACGGAGTTCATGGGAGTGAATATGGATGTGATTCGTGCCC 

TCCTCATCTTCCTAGAGAAGCGTTTGCACAAGACACACAGGCTGAAGGAGAGTGTAGCTC 

CCGTGCTGAGCGTGCTGACTGAATGTGCCCGGATGCACCGCCCAGCCAGGAAGTTCCTGA 

AGGCCCAGGTGCTGCCCCCTCTGCGGGATGTGAGGACACGGCCTGAGGTTGGGGGAGATG 

CTGCGGAACAAGCTTGTCCGCCTCATGACACACCTGGACACAAGATGTGAAGAGGGTGGC 

TGCCGAGTTCTTGTTTG 

Sequence 1667 

NCGCGTCCGACACTATTTAGAGAGCTCCCTTCCCACCTCTCTGCCCAGCCTTGTTACCTC 

ACTTCTGCTCTGGCCATGGCTGTGAAGGGCCCAGCCAGCTCCCTGTTTTGATGTTCTGTG 

CAACAGCTCCGGGGTCTTGTGACTGGAGATCCTCAACAGGCCCTGGAGCCAGGACTGGAG 

TCTTGGCAGCTGATGAGCAGCACCTTGCCGGCCAGGAGGAGCTGATGCTGACGATCTCCC 

CAACATCTGAAGGCTTAAAGMCATTGTCGTTCTTCAGCCCTCCTTGCTTCTCTCAATAC. 

AATAAGACATTGCAGAAGCAAAAGGGTGGCCTCTGCTCCAGGCAAGGCAGCTGGCTCTGT 

CTGGGGGCGTCGGCCTGGGGCTTGGGTGCCACGTGCTGAGATTGCATAGTCAAAACAAGC 

CATTTTTGCCMCAATAGCTTGTGGCTCCCACATTTTTCTACCCTTGCACTNAANGGCCA 

GACCACTCTNTGCATGGACCAANACCATNTTTCCAAACCCATGGGGC I Mill I NCC 

Sequence 1668 

CANGAATACTGAAAAATGAAGCCTAAAATGAAGTATTCAACCAACAAAATTTCCACAGCA 

AAGTGGAAGAACACAGCAAGCAAAGCCTTGTGTTTCAAGCTGGGAAAATCCCAACAGAAG 

GCCAMGAAGTTTGCCCCATGTACTTTATGAAGCTCCGCTCTGGCCTTATGATAAAAAAG 

GAGGCCTGTTACTTTAGGAGAGAAACCACCAAAAGGCCTTCACTGAAAACAGGTAGAAAG 

CACAAAAGACATCTGGTACTCGCTGCCTGTCAACAGCAGTCTACTGTGGAGTGCTTTGCC 

TTTGGTATATCAAGGGGTCCAGAAATATACTAGAGCACTTCATGATTCAAGTATCACAGG 

MTTTCACCTATTACAGAGTATCTTGCTTCTCTAAGCACATACAATGGATCAATCCATTA 

CTTTTGCTTTGGAGGATGGAAAGTTATGAGATATATGTTGAAGACTTGAAAAAAG 

Sequence 1669 

GTCGACCNCGCGTCCGCCCGCCCATCACTGCTGTTCCTCCAGGGCCAGCACTCGGGCGAG 

GCAGGGGAGCTGCCTTCGGTACATAATTTGAAGGGGCACTCCCTCTTGGGCACATGCCGG 

CCCTGAGTGCCTCCCTTGCCTCACTCTGATCCTGGCCCCATAATGTCCTCAGTGGAAGGT 

GATGGGGGCCGGTGCTGTGGGGAGAGTAGAAAGAGGGGTTGGCATGACTAAAAATACCAG 

TATGTGTATTAAGTATTTTGAGAATGAAATGCCAAGGAGTGCCTACTATATGCCAGCTCT 

AGGAATGGAGTAGACAGTGGACACAAGAAGGACTTACGCCCTGAGCACAGGTGCCAATGG 

TGACAAGACTGGCAAGACGTGAGGGCATGAATGGTTCATTCAGGCAGCTGCTGCAGATGT 

GGTCACCTGGTGCCATCTGCTGCTCCCTTTTCCACTTTTCTATGTCCTCCTTCCACCCCA 

A 

Sequence 1670 

CGACCNCGCGTCCGGTCTGAAGGGTCTGGCTGGTGAGCCAGGTTTTAAAGGCAGCCGAGG 
GGACCCTGGGCCCCCAGGACCACCTCCTGTCATCCTGCCAGGAATGAAAGACATTAAAGG 
AGAGAAAGGAGATGAAGGGCCTATGGGGCTGAAAGGATACCTGGGCGCAAAAGGTATCCA 
AGGAATGCCAGGCATCCCAGGGCTGTCAGGAATCCCTGGGCTGCCTGGGAGGCCCGGCCA 
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CATCAAAGGAGTCAAGGGAGACATCGGAGTCCCCGGCATCCCCGGTTTGCCAGGATTCCC 

TGGGGTGGCTGGCCCCCCTGGAATTACGGGATTCCCAGGATTCATAGGAAGCCGGGGTGA 

CAAAGGTGCCCCAGGGAGAGCAGGCCTGTATGGCGAGATTGGCNCGACTGGTGATTTCGG 

TGACATCGGGGACACTATAAATTTACCAGGAAGACCAGGCCTGAAGGGGGAGCGGNGCAC 

CACTGGAATACCAGGTCTGAAGGGATTCTTTGGAGAGAAG 

Sequence 1671 

GAGTCGACCNCGCGTCCGCAACTGGTGTCCAGCTCGGTGCACTCCAAGCGCCGTTCCCGA 

GCGGACCTCACGGCCGAGATGATCAGCGCCCCGCTGGGCGACTTCCGCCACACCATGCAC 

GTTGGCCGGGCCGGAGACGCCTTTGGGGACACCTCCTTCCTCAATAGCAAGGCTGGCGAG 

CCCGACGGCGAGTCCTTGGACGAACAGCCCTCTTCTTCATCTTCCAAACGCAGTCTCCTG 

TCCAGGAAGTTCCGGGGCAGCAAGCGGTCACAGTCGGTGACCAGGGGGGAGCGGGAGCAG 

CGTGACATGCTGGGCTCCCTGCGGGACTCGGCCCTGTTTGTCAAGAATGCCATGTCCCTG 

CCCCAGCTCAATGAGAAGGAGGCCGCGGAGAAGGGCACCAGTAAGCTGCCCAAGAGCCTG 

TCATCCAGCCCCGTGAAGAAGGCCAATGACGGGGGAGGGCGGCGATGAGGAGGCGGGCAC 

GGAGGAAGGCAGTGCCCCGTCGGAAT 

Sequence 1672 

CGCGTCCGCTCGCGGCNGGGCATCGNGTACATCCTCAGCAACCATGGGCTACGTGCGCCA 

GCTCTCCCAGGCCCTGGACACATCCAACGTGATGGTGAAGAAGCAGGTGTTTGAGCTACT 

GGCTGCCCTGTGCATCTACTCTCCCGAGGGCCACGTGCTGACCCTGGACGCCCTGGACCA 

CTACAAGACGGTGTGCAGCCAGCAGTACCGCTTCAGCATTGTCATGAACGAGCTCTCCGG 

CAGCGACAACGTGCCCTACGTGGTCACCCTGCTTAGCGTGATCAACGCCGTCATCTTGGG 

CCCCGAGGACCTGCGCGCGCGCACCCAGCTGCGGAACGAGTTTATCGGGCTGCAGCTGCT 

GGACGTCCTGGCTCGCCTGCGGTGAGTCCCCACTGTAGCGGTCCTGCCGGNTTNCCCCTC 

CTGCTCCCAAGGCCAGGCCCACCTGCCCTTTGGCTCCCAGCCACCTCACCTAAGCAGCAC 

CTTCCAGATGGCAGGGGAGGTGGC 

Sequence 1673 

GTCGACCACGCGTCCGGCCAGAGCTGAGTGGCAGCCGCCTCCCTTATGCAGGACATGTGC 

TCTCGGCTTCACCAGGGTTCTGACCGGGTCTGCTTCTGCATTCACAGCGCCTCCTGGACC 

TGAAGGCATCTGAGTGTGAGACCCTGTTCTAACTCTTAGAAGTGACATTGTAAGAGGTGG 

TGGGGACCAGCTAATTGGTCCAACCCAGCCTGAGTGCACCACCCTTTGAACAAATGTATC 

AGTGATGAAAATTTGCCTTTGCCCCGGCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGTGGGCGGATCACTTGAGGTCGGGAGTTCAGGACCAGCCTGGCCAGCGTGGCGAAACCC 

CGTCTCTACTAAACATAAAAAAATTAGTCAGGTGTGGCGGTGCGTGCCTGTGGTCCCAGC 

TATTCAGGAGGCTGAGGCACCAGAATTGCTTGA 

Sequence 1674 

TGACGGCGGCCCGGCCGACGGGAGCCGGGGCGGGGCGGCGGNCCANCGAAGGAGCGCGCG 

GGCGGTCTGGCCCCGCCCCCTCCCCGCCCGCCTTCCCGGTGACCTTCAGGGGCCCGGGTG 

GCGGGCGCAGGCCCCTGCGGCGGCGGCGGGATGTTCGTGCAGGAGGAGAAGATCTTCGCG 

GGCAAGGTGCTGCGGCTGCACATTTGCGCGTNCGACGGCGCCTAGTGGCTGGAGGAGGCC 

ACCCNGGACACCTACNGTGGANAAANCTCAAGGAGCGCTTGCCTCAAAGCACTGTGCTCA 

TGGGGAGCTTANAAGATCCCCAAAAGTATAACCCATCATTAAATTTAATCCCACGCTGCC 

TNAANANAAGGGGTGCTTGNGTGATTGCCATGNACCATNCTTGGGAAGGMGAAACCATT 

CCCAGGACCCAAAAGATGGGCCCTATTTNTTGGATTA 

Sequence 1675 

CACGCGTCCGGGATCCCGTACCCGGGACAGACTCGGCGCCGCTGGCTGGCCTGGCCTGGT 

CGTCGGCCTCTGCACCCCCGCCGCGGGGGTTCAGCGCGATCTCCTGCACCGTCGAGGGGG 

CACCCGCCAGCTTTGGCAAGAGCTTCGCGCAGAAATCTGGCTACTTCCTGTGCCTTAGTT . - 

CTCTGGGCAGCCTAGAGAACCCGCAGGAGAACGTGGTGGCCGATATCCAGATCGTGGTGG 

ACAAGAGCCCCCTGCCGCTGGGCTTCTCCCCCGTCTGCGACCCCATGGATTCCAAGGCCT 

CTGTGTCCAAGAAGAAACGCATGTGTGTGAAGCTGTTGCCCCTGGGAGCCACGGACACGG 

CTGTGTTTGATGTCCGGCTGAGTGGGAAGACCAAGACAGTGCCTGGATACCTTCGAATAG 
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GGGACATGGGCGGCTTTGCCATTCTGGTGCAAGAAGGCCA 
Sequence 1676 

TCCCTCTGCTGATGATGGATGCCCCTAACACCTGTGCCTAACACCCCTACTGAACCCCAC 

AGCTCCAGCCTTAGTTTTTGGAGTCAAGTGTTAAAGGTTTCTGGCCAGAGGAATTGGGGT 

CTTGCCATCCCTGCAATAGCCCTTTTATGGGCTCTGGGAGACAGCTTTAGGGAATAAATG 

GGGATTTTCCCCTTTTTCTACCCACTCCTTTGCTTCCTCCMGACTTACCCMCTCCTTC 

CCCCTCAGAGAACCAAATAGCCTGAGGMGCAGGAGAGTTCCTGGTTATGGCAGATTCTT 

GGTGATTTGGGGCTTCAAGACAGTAGGTGAGAGATGCTGTCAGGGACGTATCTTCTTCAT 

ACCAMGTCACTGGTCCTTTCTCAGCCTCTCTCGTGCTTTTCTCCTMTGACCATATTTT 

TGCCAAAAATTGGGAATATGTTATCTGACAGACCAGAATATTTGAAGGTTTGGGCTG 

Sequence 1677 

GCGCCGCGGATATNCGGATCAACCTATGGTNTCAATATTGTNAGTTATTCAGCATAAACA 

GMTTATTTCCCAANACTTGATCTGAMTATTN^MTGGTCNTACTNGAAACTTATATT 

CTTNCTGGGAGNGANGTNTTTATCATTTTTCCATGGAGACAGGTTCTAACTCTGTTGCCC 

AGGCTGCANTGCAGTGATGTGATCATAGCTCACTGCAGCCTGAAACTCCTGGGTGTCAAG 

TGATCTCTGGCCTCAGCCTCCCAAGTAGTTGGAACTTCAGATACGTGCCACCACAACCAG 

CTAATTTATTTTTTAGAGATGAGGTNTCGCTATGTNGCCCAGTCTGGCCNNCTAGCCNCA 

AGTGATCTGGCCATCTNAGCCTTCAGTTGGAGATGTCTGATTTATGTTAATATAAGAAAG 

CTGTTGATCGTTTATCATAAANGCATTT 

Sequence 1678 

GTCNCCNCGCGTCCGCTCCTCCGCCGGCATGCAACTCGGCGCCCGCGGTCCATGGACCGG 

AACCTCGGGCCGACGGACGGGAACCCGGGCCGCGATCGCCGCCTCCCCGCCTCAGGCTCC 

TCCTCCTCGCTCTCCGCCGCCTCCGCCGGACTCCCGCAGGCCCTGCACCGCCGCCGCCAG 

GCTAGCGGAGCTGCCCCGGGAAGCTGGGTGACGGGTTCGCGGCTGCCGCCGGACTGCGGC 

CTACTCCGCCGCCTCTCAGTGCTATTGTCCCTGGGCCTGGCCTTGAGCGGGTCCACTGGG 

GAAGGCNCGTGTGCGCCGGCTCCGCGGAAGATGCCGGACCAAGCCCTACAGCAGATGCTG 

GACMGAAGTTGCTGGGTTTGTTTTGCTACTGATGAAGATGATAGAACAGCTGAATGGGT 

GAGGACCATGGCAGGTGCAGGAGGATCTACAAAATGGGTTCACCAGGGCCTGTCTACAAC 

GCTGGGTGGATGAAAAGCAAAGAG 

Sequence 1679 

GCGTCCGGGCCCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGC 

CGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGTGTGGCT 

GCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTG 

CCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAA 

TMCTACAGGCTTTTCCCMGACTACMAMCTTCTTGAMGTGACTACTTTAGGTATTA 

CAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAA 

GGGGACTGTGCTGTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCG 

AGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGAAGGAATGTGAACAAGCTGAACG 

ACTTGGAAGCAGTGGAT 

Sequence 1680 

GTCCGGCGTGGGGAAGGGTGGGGTGAGGGGGCGTGGCCGCAGCTAGGGCGGCGAAACTCT 

CCTCCCCTCGGCCCCACCGCGTGGGACGGCGTGAACGTGGTGTCGGAGGGATGTCAGCCT 

TCTCTGAGGCGGCGCTGGAGAAGAAGCTGTCGGAGTTGAGCAACTCGCAGCAGAGCGTGC 

AGACCTTGTCCCTGTGGCTCATTCACCACCGTAAACACTCGCGGCCCATCGTCACCGTGT 

GGGAGCGGGAGCTGCGGAAAGAGTGGAGGTGCAACAAATGAGAAATTCCAATTGGAGATT 

TTGTCAAACAGGMTTTACTCCAMCCMACAGGAAGCTTACTTTTCTCTACCTAGCCAA 

TGATGTCATACAGAACAGCAAGAGGMGGGGCCAGAGTTTACAAAAGATTTTGCACCAGT 

TATA 

Sequence 1681 

CCGGCAAAGCAGGGACTCCTGATTTATATGTCCCTCCTCCTGGCAATCCTCTCACCCCAC 
CTCCCCTGAGAACCTCAGTTCTTCCTAAATTGCTAAAGCTGAGGGGAAAGGGATGCTTTG 
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CGGCAAAGGCGCTGCTGCCTCAAGCTTGCTTTACATGCCTCTCTAGTTCCTCTCGCACTA 

CAGAGTGGTAGCGAACAAAAGCGTTCGCCCTAGAAGCGACCTGAATGGAAAAATCTGCAC 

ACGAACTAATGGGTTTGTCACGGAAAGTAGGGAACCGGGTCTGCAGCATTCCCTGGAGAC 

AGACTTTCTGGTTGGTTTTCAAGGGTCCAAGGCAGCCATCAGCCCGGCTGTGCCCTCCCA 

CCCTGCCTCCCACCCAGTTGATTCTCTCTTTGTGTAAGTTTAGCCCCTCTGAGGGTGGTG 

GAGTGAGAGCATCCCATCAGATATATATACGATTCATCAGTCGGCACTTAAAAAG 

Sequence 1682 

TCACCNCGCGTCCGAAAAACGCAGATGATATACCTGCAACATCNGTCATGGCTGCGCCCT 

GTGCTCAGAAGCAACCCGGGTGGAATATTGCTGGTGCAACAGTGGCAGGGCACAGTGCCA 

CTCAGTGCCTGTCAAAAGTTGCAGCGAGCCAAGGTGTTTCAACGGGGGCACCTGCCAGCA 

GGCCCTGTACTTCTCAGATTTCGTGTGCCAGTGCCCCGAAGGATTTGCTGGGAAGTGCTG 

TGAAATAGATACCAGGGCCACGTGCTACGAGGACCAGGGCATCAGCTACAGGGGCACGTG 

GAGCACAGCGGAGAGTGGCGCNCGAGTGCACCAACTGGAACAGCAGCCGCGTTGGCCCAG 

AAGGCCTACAGCGGGCGGAGGCCAGACGCCATCAGGCTGGGCCTGGGGAACCACAACTAC 

TGCAGAAACCCAAGATCGAGACTCAAAGCCCTGGTGCTACGTCTTTAAGGCGGGG 

Sequence 1683 

CCGTCCGCTCCTTGGCAAGAACGAAAGGTGTGATGAAACCTCCCTGCTCGGAAGGGTCTC 

CGTGGAGGTGTCCTCATTTCACATGCTGGGTTTTGCAAGCGAGGAAGCCAGGCAGTGGAG 

GAACTAGAGAGAGGCAGGCGTGTGTGTGGACAAGCGCTGGAGCCGCAGCCCTCAGACTGG 

CACGGGAACGCCAGCGTTGGGTGTTCAGATTCCACGCGTATGTCTGGGCTCACTCACAGC 

ATGGCCGAGTGTCTGCAGTGCTGGTCCTGACCCTTCCAGAGCAGCAGTGGACAGATGAGA 

TAAGACTGTTTCAGAAACAAAGATGGCCACAGCCTTCCTAACAAGCAGGTCATCTGGCCA 

TGTCTGTATTGTAACTGGTAAAAGGCTTCAAGTCAGATTGATGATCAAGAAAANGTCAAA 

ACCCCAGCCCAAGATTGGGAMGCAGGTTNGTGGNTCCAANGCTTTTTAAAAAAATTATT 

TGAAGCTCTTCATTCCTNTTTCTGTGAGTGTGTCTTTCTCTT 

Sequence 1684 

NCCCATACTGGGGGCCCCCTTCCTGCAGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTT 

CCAAGACACTGTCACTGATGTGGATAAATCTTGGAAGGAGCTCAGTAATGTCCTCTCAGG 

GATCTTCTGCGCCTCTCTCAACTTCATCGACTCCACCAACACAGTCACTCCCACTGCCTC 

CTTCAAACCCCTGGGTCTGGCCAATGACACTGACCACTACTTTCTGCGCTATGCTGTGCT 

GCCGCGGGAGGTGGTCTGCACCGAAAACCTCACCCCCTGGAAGAAGCTCTTGCCCTGTAG 

TTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGATCGCTTGTTCCACACCAGCTACCA 

CTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATGCACCGCTGTACTAGCATCTCCT 

GGGAGCTGAGGCAGACCTGGCAAGTTGTATTTGATGCCTTCATCACGGGGCAGGGAAAGA 

A 

Sequence 1685 

CCGCTGGTTATTACCCAGCTGGATGGTTTCCTTTTAGGCAAGAAGGAGGTCATCAGCAGG 

CTCCCAACAATAATGCCGAAGTTAACAATGATGGGCAAAATGCAAACAACTTGGAACTTG 

AAGAAATGGAGCGTCTTATGGATGATGGGCTTGAAGATGAGAGTGGAGAAGATGGAGGTG 

MGATGCCAGTGCMTTCAAAGGCCTGGATTMTGGCTTCAGCTTGGTCTTTCATCACCA 

CCTTCTTTACTTCACTAATACCAGAGGGGCCTCCCCAGGTTGCCAATTGACCTGAAAAAC 

TGTGCCAGCTACMGGAGGGTCTGACTTCAGGAMGTGGTTTAAATAACAGTGCAATTTC 

MAAMATTTATAACTTTCTTTTGATCATCATGTACAGAGGTG I I 1 1 I I I < CTTTAGGCT 

TCTCATG CATATGAATATTTTAAGCACGAATGGACTACTAAATATCTGAG I 1 1 I H I M I 

I I I I I I I I I AAAGAATC 

Sequence 1686 

CGCGCCGGTTTGGCTGCCCTGCATAAGCTGCTACAAATAGAATAAAGAATTTCATACGCC 
TGTATCTATCATTTAGATGCATGGAAAAAMTGGGCTTTGCACACAATGGGTTTGGAGCT 
GAC TGGGA ACAATGGAAAAAATTACATTAGCTGTGGTTGTAAAG lllllll GTTTTTGGT 
TTTGI i I I I I I I I ICf I r I I ICI I I I f I 1 1 I 1 1 I I IACCATCTTGTGAAAGGTTTCTGAA 
ACTCGATAATAAAAAGCGGTTGGTGTAMTTATTCTTTTGTGTCACA'l I I I I AG AAGGAA 
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AAACATAAAAGAATGTATCCTTAGTACTGGTTCTTAAACAGCCCATAAAAACCCATTGGC 
CTGAAGCTTATATCTCAGGCCTATGCCCATCTTATAGTCTTGGAAGACAAAA 
Sequence 1687 

CGCGTCCGTGGGTCTCGCCCTCAACCTGTGCATGTATATGTGTGTCTTTGTGTGTGTATG 

TGTGATCTCTGCCTGCAGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCC 

CGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTG 

GGCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATC 

ATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTC 

TCTGCTGTGCTGGTGGAGCCCCCGCCGATGAAGCCTGCCTGGTGCTTGGCCTGAAGTTTC 

CCAAGGAAGCTGTCTGGTGGGCCCCTTGGGTGATGGGCCCCTTGGAGGGCTTGGAAGCCC 

CCCCATGGGCCMGCCCCAGCCCTGGGCTTCTGGTTCNTCTTGTCCCTGGTCACCCCCAT 

CCCCACTCCCCCTTGGGTGGGCCTTGAACTTNCCACTTCCCTTGGGTGGGGCCC 

Sequence 1688 

AGGAGGNTTGAAGGAGTTGNNGGAGGAGGAGGATGGAGGCGAGGGCGAGCGAGCCCAGCG 

GGGTCCNGGNCGCCCCGCGGGCCAAAGTCGAGCCCTNCCGCCNNTGGGCGAGCGCGCCAG 

CCGCCCNTTTCANAACAGTTCGNCGCCACAAAANAAAAGAACGGGGGGGTGCCGAGGTTN 

CCCATTGANCTCTTAAAGTGGTGCAGGTCCCTGTTGAGTGCGCTGCACCGGGCCGTGACC 

CGCGCCCCTGTGCGTCCC 

Sequence 1689 

GGAGTCGACCACGCGTCCGCGCCGGCCGGCGGTGTCCGGACCGCTCGCCCCCGTTTGGAC 

CCGACTTCGGTTCTTCTGGGGTGTTGATGCTCCTAAAGCCCGAGAGCACGTGTCCAGACC 

CTAGCCTGTACGACGCTGACTCTGCCCGGTCCCAGAACCAAAGCCATGCCGGGGTGTGGC 

CTCTGACCCCACGCGGAGGGGACCTCGCCTTGCGGGACCCCACCTGGAACCCGACCTNCC 

AGNCTCGCAGCCGGCCTGAGCCGCCATGCGCGGGAAGTTGCTGCCGCTGGCCGGCCTATA 

CCTGGTGCAGGGCCTGCCCTACGGGCTCCAGTCCGGCCTCCTGCCAAATGCTGCTTGCGT 

GCCGGCGGCCTCTCGCTGACGCGCGTGGGGCTGGCCAAGGTTCTGTACGCTNCGTGGCTT 

GCTTCAAGCTGGCT 

Sequence 1690 

CNCCCCGCGTCCGCGGACGCGTGGGTGCTTGCTGCTGAACCTGAGCTGCAAGTTGGAATT 

GATATAATGAAGACTAGTTTTCCAGGTCGTGGTTCMTTCCAGMTTCTTTCATATTATG 

AAAAGAAAGTTTACCAACAAAGAATGGGAAACAATCAGAAGCTTTAAGGATGAGTGGACT 

CAGCTGGATATGTTTTATAGGMTTGGGCACTTAAGGAAAGCTTCATAAAAGCCATTGGT 

GTTGGACTAGGATTTGAATTGCAGCGGCTTGMTTTGATCTATCTCCATTAAACTTGGAT 

ATAGGCCAAGTTTATAAAGAAACACGTTTATTCCTGGATGGAGAGGAAGAAAAAGAATGG 

GCATTTGAGGAMGCAAAATAGATGAGCACCATTTTGTTGCAGTTGCTCTTAGGAAACCC 

GATGGGATCTAGACATCAGGGGATGTTCCATCTCAGGATGATTCCAAACCAACCCAGAGG 

GCMTTTACTATTCTCAACTTTAATGATTTAA 

Sequence 1691 

GACCACGCGTCCGCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCC 

GGGCCGCTGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCGGGGCGCTGTGTCCCC 

AAGCTCCCCGTCCTCGGCCAGGCGGGCACCACGGCAGGGGCTGAGCTACCCTCATGGAAG 

GGAGAGGACCGTACCGGATCTACGACCCTGGGGGCAGCGTGCCCTCAGGAGAGGCATCCG 

CAGCTTTTGAGCGCCTAGTGAAGGAGAATTCCCGGCTGAAGGAAAAAATGCAAGGGATAA 

AGATGTTAGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAAGG 

CAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTCTCCCTCCTTGGGCTCCT 

TCGACCCCCTGGCTGAGCTCACAGGAAAGGACTCAAATGTCACAGCATCTTCCACAGCCC 

C 

Sequence 1692 

ACAGMTTTAGGGGTGGGTGAAAGCACTTGNGCTTTAGCTNTTTCATATTAAATATATAT 

CTATATTTAAACATTCATGGCATAGATGATGATTTACAGACAATTTAAAAGTTCAAGTCT 

GTACTGTTACAGTTTGAGMTTTGTAGTATTACATCATTACATAAGTCATTTTAGTAACA 
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GCCTTTGTGAAATGMCTTGTTTACTATTGGAGATAACCACACTTAATNAAGAAGAGACA 

GTGAMGTACCATCATMTTMCCTAMTTTTTTGTTATAGCAGAGTTTTCTTGTTTAM 

AAMAATAAAATCATCTNGAAMGCATTTTGTACAGTTAAMTGTATMTGMGCTTTTG 

CCAACCAGACTGGTGCTTAGCAACCAAA I I I I I I I I I I AAATAAAGCTTTTATGGCAGGT 

GGGTAAATAAGGTGGCCTTCCAAATATATTGGTGTCTTGATGGAGAGTTNT7TAGTTGAA 

ATGAATGTGGGTCTTTTCT 

Sequence 1693 

CGGTTAACATGGCCGTCACCGACAGCCTCAGCCGGGCTGCGACTGTCTTGGCAACTGTNT 

TGCTCTTGTCCTTCGGCAGCGTGGCCGCTAGTCATATCGAGGATCAAGCANAACAATTCT 

TTATGAAGTGGCCNATCAAACAANCTGGGCCTGTTCTTGGTGTGTACATCCX)CGATTCTG 

GTATTAATTATCGACATGTTGCAAATACCCTTTCTGTTTATAGAAGTGTCAAGAGGCTAG 

GTATTCCTGACAGTCACATNTGCCCCTAATGCTTGCAGATGATATGGCCTGTAATCCCTA 

GAMTCCCAAACCAGCTACAGTGTTTTAGTTCACAANCAATNTGGAACTAAATGGTGTAT 

GG GAGAATG ATGTGGGAAGGTGNNATTATAGAAGTTTNTTGAGGTAAACNGGTGGGAGAA 

NNNTTTTTTTACCNGGGTMTTTAANCTGNGGGAGGGANTCCCCACCCTAAGTANCTTCC 

TTCGGG 

Sequence 1694 

GTCCG CAAGATGGACGCAGCTCTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAG 

ATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAG 

AAAAGGACGAGATCTTGTCTGATGTGGCATCTANACTTTTGNTTTACATACAGGAAAAAC 

TTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGG 

TGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGG 

TGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCXaTCCTCAACGCATTCATCGAC 

AGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAG 

TCCATAGGGCCAGGTGGTACGG 

Sequence 1695 

CCCCGCGTCCGCTCGNAGCTGTCCGCGGTCTGTTTGGCCCGAACGGCGGCGGAGGCGCTG 

ATCATGGCGACATTCATCTCGGTGCAGCTGAAAAAGACCTCAGAGGTGGACCTGGCCAAG 

CCGCTGGTGAAGTTCATCCAGCAGACTTACCCAAGCGGCGGGGGAAGAGCAGGCCCAGTA 

CTGCCGCGCGGCGGAGGAGCTCAGCAAGCTGCGCCGCGCCGCAGTCGGTCGTCCGCTGGA 

CMGCACGAGGGCGCGCTCGAGACGCTCCTGAGATATTATGATCAGATTTGTTCTATTGA 

ACCCAAATTCCCATTTTCTGAAAATCAGATCTGCTTGACATTTACCTGGMGGATGCTTT 

CGATAMGGTTCACTTTTTGGAGGCTCTGTAAAACTGGCTC7TGCAAGCTTAGGATATGA 

AMGAGCTGTGTTGTTGTTCAATTGTGCAGCCTTAGCTAGCCAAATTGCAGCAGAACAAG 

AACCTGGATAATGATGAAGGGATTGAAAATCGCT 

Sequence 1696 

TTCGGGAGTCGACCCCGCGTCCGGGCCAGCCGGCTCGCCCGGGGGCCATGGCAGCAGCGG 

CTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCCTCCCGGGGAAGTCATTCATCTGAATG 

TGGGAGGCAAGAGATTCAGTACCTCTCGCCAGACTCTCACCTGGATCCCAGACTCCTTCT 

TCTCCAGTCTTCTGAGCGGACGCATCTCGACGCTGAAAGATGAGACCGGAGCAATCTTCA 

TCGACAGGGACCCTACAGTCTTCGCCCCCATCCTCAACTTCCTGCGCACCAAAGAGTTGG 

ATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCATGAAGCCCAGTTCTATGGGCTCACTC 

CTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTTGGATTCGATCTTCTTGTGGAAACGTC 

CTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCCAGTGAAGCGGCGGAACCGGCACAGC 

CTAGTGGGGCCTCA 

Sequence 1697 

CGTCCGMGGMGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATTTTCC 

TGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGTCCTT 

AGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATCCCCG 

GATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTCCTGACCTA 

CGTGTACTTCGTTCTCTCACTTGGGCCTCGCATCATGGCTAATCGGAAGCCCTTCCAGCT 
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CCGTGGCTTCATGATTGTCTACAACTTCTCACTGGTGGCACTCTCCCTCTACATTGTCTA 
TGAGTTCCTGATGTCGGGCTGGCTGAGCACCTATACCTGGCGCTGTGACCCTGTGGACTA 
TTCCAACAGCCCTGAGGCACTTAGGATGGTTCGGGTGGCCTGGCTCTTCCTCTTCTCCAA 
GTTC 

Sequence 1698 

CGCGTCCGGCCGCGCCCATGGCCCGCCGCTGCCCGGCCGCGCGGGCGGGCCCGCCACGCC 

GCTGTCGCCCACGCGCCTGTCGCGGCTGCAGGAGAAGGAGGAGCTGCGCGAGCTCAACGA 

CCGCCTGGCGCACTACATCGACCGCGTCCGCGCGCTGGAGCTGGAGAACGACCGGCTCCT 

GCTCAAGATCTCAGAGAAGGAGGAGGTGACCACGCGCGAGGTGAGTGGCATCAAGGCGCT 

GTACGAGTCGGAGCTGGCCGATGCCCGGAGAGTCCTGGATGAGACGGCTCGAGAGCGTGC 

CCGGCTGCAGATAGAGATTGGGAAGCTGAGGGCAGAGTTGGACGAGGTCAACAAGAGCGC 

CAAGAAGAGGGAGGGCGAGCTTACGGTGGCCCAGGGCCGTGTGAAGGACCTGGAGTCCCT 

GTTCCACCGGAGCCGAGGTGGAGCTGGCAGCTGCCCTCAGCGACAAGCGCGGCCTGGAGA 

Sequence 1699 

ACGCGTCCGGAAGAATCTACACTTCTTTGCACCAGAGTATGGAGAAGTCACTAATGTGAC 

AACAGCAGTGGACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGAT 

TCAGGGCAATGGAGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCT 

TCTAGAAGACCCATTACAGAGGGAGTTCATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCG 

CAGACCAACAGCCAGAGMCTTCTGTTCCACCCAGCATTGTTTGAAGTGCCCTCGCTCAA 

ACTCCTTGCGGCCCACTGCATTGTGGGACACCAACACATGATCCCAGAGAACGCTCTAGA 

GGAGATCACCAAAAACATGGATACTAGTGCCGTACTGGCTGAAATCCCCCAGGCCCTGAT 

CTGCGCTGTGGCTGTCCCTGGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTC 

Sequence 1700 

GGGAGTCGCCCCGCGTCCGGATTTCAGTTGGTGGCGTCATAGTCTCATTACAGTGTCTAT 

CTTGGCATTTACCMTTTACTTGTCATCTTTGTCCCAACTATTAGGGATATCTTTGGTTT 

TATTGGTGCATCTGCAGCTTCTATGTTGATTTTTATTCTTCCTTCTGCCTTCT 

GTTGGTGMGAAAGAACCTATGAMTCTGTACAAMGATTGGGGCTTTGTTCTTCCTGTT 

AAGTGGTGTACTGGTGATGACCGGMGCATGGCCTTGATTGTTTTGGATTGGGTACACAA 

TGCACCTGGAQGTGGCCATTAATTGGCACCACTCAAACTCAAACTCAGTCCATCTGATGC 

CAGTGTTGAGTAAACTCMCTACTATGAAATTTCACCTAATGTTTTCAGTTTCACTTCCT 

TTTGAAGTGCAGATTCCTCGCTGGTTCTTCTGAGTGCAGAATAAGTGAACI i I I I IGTTT 

TGGT7TGN I MM IAAGAAAC 

Sequence 1701 

CCCACCGTCCGCGCGCGGCGCCTCGCCTCGGCCGGCGCCTAGCAGCCGACTTAGAACTGG 

TGCGGACCAGGGGAATCCGACTGATAAATTAAAACAAAGCATCGCGAAGGCCCGCGGCGG 

GTGNTGACGCGANTGCGATNTNCTGCCCANNGCNTCTTGAATGTCAAAGTTGAANAAANC 

CAATGAAGCGCGGGTAAACGGCGGGAAGTAACTAATGACTTCTCATTAAGGGTAGCCAAA 

NGCCCTTCGTCATCTNAATTAAGTTGGACCGCGCANTGAAATNGGATGAAACCNAGANTT 

CCCACNTGTCCCTACCTACNTAATCCAAGGCGGAAAACCACAAGCCAAAGGG 

Sequence 1702 

CGACCACGCGTCCGGACAGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGGTGCATGG 

CCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGACTCTGGC 

ATGCTAACTAGTTACGCGACCCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCC 

TCTTCCTGCACGCCGGGATCTTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCC 

TGCGGGACCTGGAGAAGCTGGCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTG 

GTGTCACCGGCAACCTGGCCAGTGCCATCTTCCTGCCATACCGAGCAGAGGTGGGTCCTG 

CTGGCTCCCAGTTCGGCATCCTGGCCTGCCTCTTCGTGGAGCTCTTCCAGAGCTGGCAGA 

TCCTGGCGCGGCCCTTGGCGTGCCTTCTTCAAGCTGCTGGCTTGTGGTGCTCTTCCTCTT 

CACCTTTGGGCTGCTGCCGTGGATTGACAACTTTGC 

Sequence 1703 
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(3ATCGACTTCGCCTGACGGAATCCAGGGGTCGTAATATATGTAAACTCGCGTCCGNGCTG 

CGTGCCCAGAGTAGTGGCCGAATACCTTAACGGGGCTGTGCGCGAGGAGAGCATCCACTG 

CAAGTCGGTCGAGGAGATCTTCGACGCTGGTGCAGAANCTGGCCGACCAGTCGGGCTTGG 

ACGTGATCCGCATTCGCAAGCCCTTCCACACANGACAACCACTAGCATTCAGGGCCAGTA 

GGCACCCCTTCACCAACAAGCTTGACCACGTGTCCCGCGGAACTANCGCACCCCCGAGNA 

GGGTTCAANGNATTCCTTGCACCCAGCCCAGATGCCCTGGCTTNTGGGGGGNCAAGTGAC 

CCTTTGTGNAACCCACTCATTTTTTATGGCMGGTGAGCATTNCCTAAAAACCCTTGAAA 

MTGANGGGMNAACTTTCAAGGGGTTTTTTACAGGGCCCTTGGI MUM I AAAATCCC 

CAANATTTGGATAATAAATGGAATCCTCAAAAACACAAGTGGAGGAAGGNTCTTGAAAGG 

GC 

Sequence 1704 

TCGACCACGCGTCCGGCCGGAGAAACTTGAGCCGGCTGCCX^CGCCCACGGTGCCCGAAGC 

CCCAAAGGCTGGAATTAGGGGCTAGAAGTCTGGCACCCACCGCCTGGCCAGGTGTTCGGG 

ACGCGACCAGGTGGGCGGTCGCCCCGCCCCGGGAGCGCGGCTTAATAGCTGAGAGCCCGG 

GGGCCAGGCCGNGGCTGCGGCCCAGGCAACGCCCTGAGGGTGGCCACGCTGNCAGGTGTT 

CCACTCCCCCGGGACTATGGGCAAGGGCCCGGGGCGGGGAGGGCGGCAGGTGCTGACACT 

GGAGCTGCGCCGGAGGTCGGGGAACTCGGCCTCCTAAGACTGAGGACACTCGCCTGCTGG 

GCCGGNCGAGCTGTGCGGTGCCCTCCGGGACGCAGGGGGCGCTGCAGCCACGCTGGGTCA 

GGCTCCGAAGGGCCCTCCCAACCCGGGGA 

Sequence 1705 

CGCCACGCGTCCGGAAAGATGGAGGTGTGGGGACAGGAGCTGGGTGTGCTGGGGACTGGC 

CGCGGACCCCTAACCTGTGTCTCCGGTCTCCCTCCGGGAGCGGCTCAACCCAGCCCATCG 

CTCTGGCCCCGTTCTGGCCCTGCAGGGTGGTGGTTGGGACGTTGAAATGAGCGCGCGAGT 

GGTACGTCCTCTCTCCGCGCTCACGCCCCCCTCCTCACCGTGTTTCCCGCCAGGACCATC 

AGCACGTGCCCATCGACATCCAGACCAGCAAGCTGCTCGATTGGCTGGTGGACAGAAGGC 

ACTGCAGCCTGAAATGGCAGAGTCTGGTGCTGACGATCCGCGAGAAGATCAATGCTGCCA 

TCCAGGACATGCCAGAGAGCGAAGAGATCGCCCAGCTGCTGTCTGGGTCCTACATTCACT 

ACTTTCACTGCCTAAGAATCCTGGACCTTCTCAAAGGCACAGAGGCCTTCCACGAAGAAT 

ATTTTTGGC 

Sequence 1706 

TCGCCNCGCGTCCGCTGAAGCAAGAGAATCACTTGAACCCAGGAGGTGGAGiSTTGCGTGA 

GCTAAGATCGCGCCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCGTCTTAAAAA 

AGCCCATGGCAGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCA 

AGGTGGGCGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCGAACACGGTGAAACCCC 

GTCTCTACTAAAAAAAATACAAAAAAATTAGCCAGGCGTGGTCGTGGGCACCTGTAGTCC 

CAGCTACTCAGGGGGCTGAGGCAGGAGAATGGCGTGAATCCGGGAGGCGGAGCTTGCAGG 

GAGCCGAGATAGTGTCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAA 

MCAMMGCCCGTGGCMTTAATGGTAAAGGAAACCCGGCTTTTAGTGTAAAGAGGTAA 

CATAA 

Sequence 1707 

GCGTCCGGCCTCCAGCAAAGCCCATTCAGTCAGCTCTGCAGGCTCATCTTACAAGAATAA 

TCCCTTTGCCAGCTCAATCTCCAAACATGGGGTTTCTTCTGGCAGCTCTTCCTCGGGAGG 

AACACCAGTCCAGAGTTCTGTTTCTGGGAGCCTGGTCCCTGGCATACAGCCTCCCTCCGT 

GGGACAGGCCACCAGCCGACCCGTCCCAAGTTCAGCAGGGAAAAAAATGCCTGTTTCCCA 

GAAGTTGACTCTGGTAGCCCCTCCAGGCGGTCCAAACGGAGATTCCAGTGGTGGGACCCA 

GGGGAGTGGCAAAGTTGCTGACCTCGCCGTCCCTAAAGCCCTCTGCAGTTAGTAGTGTGA 

CATCGTCTACCTCCTTGTCAAAAGGAGGCGAGTGGGGACTGTGCTGCTGGCCGGCTCCTC 

TTTGATGGCTTTACCCTACAAATCCAGCAGCCCAAAGCTGTCTGGGGCCATGAGCTCGAA 

CTTCTTG G G AAATT AT AC 

Sequence 1708 

CACGCGTCCGGGAAGCGGTGCGCTCCGTCACCACGGGAGTGCGGGAATCCGCCGTTTGCG 
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CTGAGGCAATGGCGGCAGCTGCGCCGGTGGCCGCGGACGACGATGAGCGGCGGCGGCGGC 

CGGGGGCTGCACTGGAGGACTCCCGGTCCCAGGAAGGGGCAAATGGTGAGGCCGAGTCAG 

GTGAGCTCAGCCGGCTTCGGGCTGAGCTGGCAGGCGCCCTGGCAGAAATGGAAACCATGA 

AGGCTGTGGCAGAGGTGAGCGAGAGCACGAAGGCCGAGGCTGTGGCTGCGGTGCAGCGGC 

AGTGCCAAGAGGAGGTGGCCTCGCTGCAGGCCATCCTGAAAGACTCCATCAGCAGCTATG 

AAGCCCAGATCACCGCCCTGAAGCAGGAGCGACAGCAGCAGCAGCAGGGACTGTGAGGAG 

AAGGAGCGGGAGCTGGGCCGCCTGAAGCAGCTGCTGTCCCGGGCCTACCCCCTGGA 

Sequence 1709 

CACGCGTCCGCGGACGCGTGGGTCCGCGGCGGCGTCCGGGGTCTCCAGTAGGGCTGACGC 

TCCGGTGCTCGCACAATCCCCCGCCTCGGCTGGCAACGGGCGTCCCTCCACTCCCCGAGT 

CCCCGGCAGCCGCCGCCACCCCAGCGCGCCCCGATCTGGCCCCCTGCCCCGCGAAGATGG 

CTGCCGTACGCCGGGCCCGCAGTTATTGCCGCTGCCTGGTGCGCTTCTCCGACCGAGAAC 

TCTGCTAAGCTCCGCTGCAGAGACAGGCAGGAGTAGACACCCGGACACCCAGCACCCCTC 

CTTCGGGGGGCGGTGCAGAGGGGGCACGGAGAGCCCCTCGAGCGCAGCAGGCCGCCCCGC 

CAGCATGGCAGAAGCTGAGGAAGATTGTCATTCTGATACTGTCAGAGCAGATGATGATGA 

AGAAAATGAA 

Sequence 1710 

ACGCGTCCGGCGAGTGGCCTTCCCGGTTGGCGCGCGCCCGGGGCGGCGGCGCTGGAGGAG 

CTCGAGACGGAGCCTAGTTATGTCTGGGAGGCGAACGCGGTCCGGAGGAGCCGCTCAGCG 

CTCCGGGCCAAGGGCCCCATCTCCTACTAAGCCTCTGCGGAGGTCCCAGCGGAAATCAGG 

CTCTGAACTCCCGAGCATCCTCCCTGAAATCTGGCCGAAGACACCCAGTGCGGCTGCAGT 

CAGAAAGCCCATCGTCTTAAAGAGGATCGTGGCCCATGCTGTAGAGGTCCCAGCTGTCCA 

ATCACCTCGCAGGAGCCCTAGGATTTCCTTTTTCTTGGAGAAAGAAAACGAGCCCCCTGG 

CAGGGAGCTTACTAAGGAGGACCTTTTCAAGACACACAGCGTCCCTGCCACCCCCACCAG 

CACTCCTGTGCCGAACCCTGAGGCCGAGTCCAGCTCCAAGGAAGGAGAGCTGGACGCCAG 

AGACTTGGAAATGTCTAAGAAAGTCAGGCGTTCCTACAGCCGGCTGGAG 

Sequence 1711 

CNCGCGTCCGAAGGCACAGGCGTCTTGCTCTGTTGAAGCAAGTCAGTATCCGAGAAAACT 

GCTGTTCCCTTTGTTGTGATGAGGTAGCAGACACACAATTGAAGCCATGTGGACACAGTG 

ACCTGTGCATGGATTGTGCCTTGCAGCTGGAGACCTGCCCATTGTGTCGTAAAGAAATAG 

TATCTAGAATCAGACAGATTTCTCATATTTCATGACACATGTGAAGAGGCATCGTGGACT 

TTTTTCTACTCAATTCCAGCCAATGTTGAAAAGAAAAAGAAAAAAAAAACTCTAATCAGT 

TGTACACACATTGAMCTTATAGCCATGGCCAGATTTTATGCTAAAAATGGTAGTTTGTC 

MAGACAAAATTCTCTTAGAATCTMTCCAACTTGCCAGCCCTGAGAAAATCCCTTTTAA 

GGCCAAGGGAAAGCTGAATGCTAGCAGCCAGGCCTGTGGTACTTCCATGAGAAACCATAG 

CAGGACAATGCCCTC 

Sequence 1712 

CCACCGTCCGGGCGGCCAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAA 

GATGGCAGAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGC 

GCGGTCGGCGGTTTCCAGCCAATGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCT 

GGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTC I fill I CCTGTCTATGCTCTGTCTC 

ACTTAACACTACGTTTTCTGCTATGGTCTGTGGTTGATGACCTCMTATGAGTTTCGATT 

G1TMCCGTGTTTTTGTTTGGGAAGTAATTTTGTTTGAAAATGCTCTCACATACAGGAAT 

TAGGGCCTAGATTGTAAGCTCTTGCAGCAGTCACATTTGTTCCCGGGCTTTGGTGGTTAT 

TTTCTAAATTTTTGAGGTGCCTTTGCTATTTCTTGTGTGACCTGATAGCTTCCCCTC 

Sequence 1713 

GCGTCCGAGCCTCTGGGGGTGGATCCTGAAAGGTGGTCCAGCCGCCTGGCCCTGCGTGGG 

ACCCTCCACCTGGCAGCAGGGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGAT 

CTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCG 

AGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCC 

CCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGC 
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CCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAG 

GAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTNCATCGAGAACTCCGAAGA 

GGCCCTCACAGTCCATGCCCCTTTTCCCTGCAGCCCACCCTGC7TCCCGATCCTTCCCTG 

Sequence 1714 

GTCGCCACGCGTCCGCAGAAGATTGACAAATCTGAGGGCCGCTTCCATGTCCAGAACCTT 

AGCCAGGTGGAGCAGGATGGGCGGACGGGGCATGGACTCCGCAGATCTTCCAAGTTCTGC 

TTGAAGGAGCACAAAGCCCTCAAGACGTTAGGCATCATCATGGGCACTTTCACCCTCTGC 

TGGCTGCCCTTCTTCATCGTTAACATTGTGCATGTGATCCAGGATAACCTCATCCGTAAG 

GMGTTTACATCCTCCTAAATTGGATAGGCTATGTCAATTCTGGTTTCAATCCCCTTATC 

TACTGCCGGAGCCCAGATTTCAGGATTGCCTTCCAGGAGCTTCTGTGCCTGCGCAGGTCT 

TCTTTGAAGGCCTATGGCAATGGCTACTCCAGCAACGGCAACACAGGGGAGCAGAGTGGA 

TATCACGTGGAACAGGAGAAAGAAAATAAAC 

Sequence 1715 

CCCCCGTCCGCTTTTGTTNATCTAAAGGCTTNAGTCCCAI MM! I ATACGTTGTATTTT 

AAMACGTTTGAMGGAGTCTTACACCTGTATCATGAAAACTGAATCCTTTTGAAATACC 

ACTATATGAAGAGAGAGATGAAATTTAGTGAACAGAATTGGAAAAGGTGCTCATAATTTC 

ACTA TGCAA ACTTACCCCAGTCTCTMAAMGTAATTTAGATTTAAAGTTCTTTGATGTA 

TTTGATTTTCTAAATCTTTATGGTTATGATTTGGAATAAAATGTGCCTAATCCTGTGTTA 

CATTCTGTTCTTAMTCTGMTGCCTTCTCATTTAATTCTGAGGAAATATCACACAAGTG 

TCTTCAT TGAC CTTGAAGAMTGTATATACAGTTGCCTTATAAAACAACATAAATTTAGA 

CCATMCTTTTATAGAGAAAGGGTTTTGTCAMTGTTTTCTGAAAATCTGAGTAATTCAA 

AGCATGCCTCTGCCCCCTTTAATA 

Sequence 1716 

NGTCGCCACGCGTCCGGCGCTCTCGGCCGCCGCCGCCTCTGCGTGGGCCGGCCGGGAGGG 

CCTCGGGGGACTGACTGACAGAGTTTCACTCCTGTTACCCAGGCTGGAGGACAATGATGT 

GATCTCGGGTCACCACAACCTCCGCCTCCCGGATTCAAGCGATTCTCATGCCTCAGCCTC 

CCGAGTAGCTCAGATTACAGGCATGTGCCACCACGCCCGGCTAATTTTGTATTTTCAGTC 

GAGACGGGGTTTCCCCATGTTGGTCAGGCTAGTCTTGAATTCCCGACCTCAGGTGATCTG 

TTCGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGGCGTGAACCACTGCACCCGGCGAGG 

CATTTTTTACTGTCTACAG AAACTTATTGTAATTCA I I I I I CCTCACTCCAAGTAGTAAG 

AATTATACCAAATTGAAAAGATATGAATGAGTATCCTAAAAAAAGAAAAAGGGA 

Sequence 1717 

CCGAGGCNCTGATAAGCCNTGGTAACGGGAAACACAGCTCTAACCTCACCTCATTCTCCA 

GGTTACAAAGGCCATGTGCCCCTTTGAATCTGGCAGAGAAAGTTTCCTCGTTGTAAGTAT 

TTGCATCTACTTCAAGCCAGATTCTTCTGCCTCTTTCTCCTTTCCAGACCCCTACTCTGT 

GCAGTGCTGACCACAGCTAGAGCCACCGCCCCATTGCTCAACCAGTATTTATTTCCCTAA 

ACGACCCTTCCTCATATTCCCTTCCCTCCACCTCTCCTTACCAAGCACCCAAAAGAGGAT 

TTAGMCTAGCAGGGTGGACATCATTCTGGTTGTTTCTACTTTTCTCTGCCTAGCACAAA 

ATTGGGAGAAAACTGGAGCCTCCATCCGCAGTCACACGTGTACAGATCTGGGNGATTTGG 

ATGTAGGCTTTTTCTAACTTCTCTCTCAGAAGCTTCTACA 

Sequence 1718 

CGGACGCGTGGGTGCCGCCGCCGCCGTCGCTGTCGTAGTCGCCGCCGCCGCTGCCGGAGA 

AAGAGCACGAGCGGGGAAGCCCCAGAGTGAAATCTAGCATCCTGCCGGCTGGTCTGCCCG 

CCCCTCCTTCCTTTTCCCCCCGGCCCCNGTCCCCTCCCNCCGCAGGTGCCATCCGTCGCC 

ATNCGCCTCTCTACCCTCNCATCCCCAGGTGAGGGGGGTGAGTTCAGGAAGCGGNNACCC 

CNAGGAACCCANCAGGGTCACCATTTGCAGCGCAACATGGCAGGAGCTGGAGGAGGGAAT 

GATATTCAGTGGCGTTTTTTTCAGGTGAAAGGAGCAGTATGATGATGATGTAGCAGNAAG 

CANGATATNATTTCTACAGTANAATTTAATCNTTTCTGGGAGAAATTCTAGCAACANGAA 

GATAAAAGGTGGGTAGAGGTTGTCATTCTTTCAACAGGGAGCAGGGAGAAC 

Sequence 1719 
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TCCGGCCGGCCAGCGTCATCACCATCGTCAAGTCCACCCCGGGCTCGGGGTCTGGCCCCG 

CCCACGGNCCGGACCCCGNCCACGGCCCGGCACNGCNGCTCCTAGTGCCCGCCCCCGCCA 

TGCCTCANCCACGCCCCACCGGCCATAGNTNCCCTCCCCCAAAGNGCNCTGAGGCGCATG 

NCTTCNTGNGACNCAANGNTCCCATTNTCTATTGCAGGCAACATGGCCATTCCCCGAAAC 

TAAAAAGCAGTTGGGGNGGGNGAAAGTACNANGTGGAAAACCCAGNNATCACCGGNANTG 

GNGGGAAAACAANGGCCNGNAGGGACACATTTCCAACTTTAAGCTGGGCNAGTGNNTGGG 

GAACCAAAAAACCTGGGGNGGNCCCNTCANTGGCANGGCCCNTGAGNNCNCAGCNCATGN 

GCATTTNCAGGGGNNGAACCAGNGGGACAAGGGGGACCTCAANAAAAATGrfTGGNTGGGG 

ANGGGCCCANTCNCTGGGCNNGGGGGAAGAACCCGNATTCTACNAGCAAAAGGGGANGGG 

GGGGACNCAGCAAAGGCAATCANGGGAAGAAGGAAGATAAGGGTCNACCANCGGGAATNG 

GCAAAGAAANGGGGGAA 

Sequence 1720 

CTGANGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGNGAAACACAGGAGAGCG 

AGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTACGGGAAAGTCCAGA 

TCTACACAGCGGATATTTCAGAANTCCCTNAGTGCAACTGCAAGCCCACAGATGAGAATC 

CTTGTGGCTTTGATTCGGAGTGTCTGAACAGGATGCTGATGTTTGAGTGCCACCCGCANG 

TGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGCCAGTNCCCAGAGA 

CCAAGATCATCAAGACAGATGGCAAAGGGTGGGGC 

Sequence 1721 

CATGGCTTCTGCGAGAAMGTGATTTAGGCAGACGGAGGTTTTTTCTCAATCAGAGGCTT 
TCAGTAACTCTGCTGATGCACAGAGAAGAGACTTCCTCAGCCTGCAGGCTACAAGAGCCA 
ACTGTTAGTGCAAAAAAGGACTTTAATACAAATTTCTTATTCCAGAMTTTTGTTCCAGG 
TCTGGACAAGCTGAGAAATTTATCATTGTT^ 

GAGAGGTATGGGTGTGTGTGCAAGGCACACACATACAGTCTTTCTGTACATGCATGCATA 

TTTATGCATGTACAGGGMGTATCCAGACACCMATTTTTAATAAAATGAATTCCCCAAA 

GGGGAGTCTTGACCTGMTTMGGCTGTTGTTTATAGGGAAGCCAGATATAATTGATGNT 

GAAAAANAAACTAATTTTTATACTTAATCACCGGCAGNTANCGGGGGCANGGGGGAAAAA 

GTACAGANGGGGTGT A I I I I I I GG Il I H I I CT 

Sequence 1722 

TCGCCACGCGTCCGCTCTTAACACAGAGTCTGCAGCCCCTAACTGACACCCTGTCCTTCC 

TCCTAGGAAGTGCTGGACTCCCTGGTCAGCAATGTCAACATTGAGCTGCTCAATGCCCTC 

CGCTACCATATGGTGGGCAGGCGAGTCCTGACTGATGAGCTGAAACACGGCATGACCCTC 

ACCTCTATGTACCAGAATTCCAACATCCAGATCCACCACTATCCTAATGGGATTGTAACT 

GTGAACTGTGCCCGGCTGCTGAAAGCCGACCACCATGCAACCAA 

Sequence 1723 

ATCCGGTTCGCCCCNCGTCCGGGCGGCCGAGGCGGAGGCAGCGGCGGCGGGATGGCGGAC 

GCCAACAAGGCCGAGGTGCCCGGGGCCACTGGTGGCGACAGCCCGCACCTGCAGCCCGCA 

GAGCCGCCGGGCGAGCCGCGGCGAGAGCCGCACCCCGCGGAGGCGGAGAAGCAGCAGCCG 

CAGCACAGCAGCAGCTCCAATGGCGTTAAAATGGAGAATGATGAATCAGCAAAAGAAGAG 

AAATCTGACTTAAAGGAAAAATCTACAGGAAGTAAGAAGGCCAATAGATTTCATCCTTAT 

TCAAAAGACAAGAAT 

Sequence 1724 

GTCGCCNCGCGTCCGTGC I ! I I I I I CGACATACTGGTTTTTCTTTCTGTTTTTCTTCTCT 

TTCTTCTA TTTCT TGTGGATATTATGGCTAATAACACAACAAGTTTAGGGAGTCCATGGC 

CAGAAAACTTTTGGGAGGACCTTATCATGTCCTTCACTGTATCCATGGCAATCGGGCTGG 

TACTTGGAGGATTTATTTGGGCTGTGTTCATTTGTCTGTCTCGAAGAAGAAGAGCCAGTG 

CTCCCATCTCACAGTGGAGTTCAAGCAGGAGATCTAGGTCTTCTTACACCCACGGCCTCA 

ACAGAACTGGATTTTACCGCCACAGTGGCTGTGAACGTCGAAGCAACCTCAGCCTGGCCA 

GTCTCACCTTCCAGCGACAAGCTTCCCTGGAACAAGCAAATTCCTTTCCAAGAAAATCAA 

GTTTCAGAGCTTCTACTTTCCATCCC 

Sequence 1725 
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MTNTGCCAGCCTTTATTGAGCTTTACMCTAACAACCTGATAGTTGGCAGTTAATTCAC 

AGTTACAGATAATGCTTTTATTTACATAAATATACCAAGTAGTACCCTCTTATTGTATTC 

ACTTCATCTATTTTCTTAGMTACTTGCMTTTCTMTGACCCCTTCCCTTTCCCTCCTG 

CTGCCCTGTCCACCCTCTTTCCCCTTCTAACATCCTTAGAGGGATGAAATCTCAGCATAT 

GTTGCAGGACACCAAAAGGAAGAAAACAATCAAGCAAATAAAATAAACAGTCAAACAAAC 

CAGGAGTTTAAAACAACAACCCCAACAACAGAAGCCTTGGCAAAGAGGAATGAGTGATCA 

GCMGTGMCACACTCTATGTCMCTCTCCTTTTTATCCAGCTGAGATTTTATGGTAACC 

TTTAATTTAA 

Sequence 1726 

CCNCGCGTCCGGAGCCGAGAGTGTGTGGAGCAGTTACAGCTGGAAGACCGGGTCCTCTGC 

CTCCACAGTAGATGGCTGAATCCTCTATGCGGGACTGGCAAATGGCACTGTGGTCACCTT 

CAACATAAAGAACAACAAACGACTTGAGATCTTTGAATGCCATGGCCCTCGGGCAGTCAG 

CTGTCTTGCTACAGCTCAGGAAGGTGCCCGAAAACTGCTGGTCGTGGGGTCTTATGACTG 

CACAATTAGTGTACGCGATGCCCGGAATGGACTGCTCCTCAGAACTCTGGAGGGCXJATAG 

CAAAACCATTCTTTGCATGAAGGTGGTGAATGATCTCGTGTTCAGTGGCTCCAGTGATCA 

GTCAGTCCATGCTCACAACATTCACACTGGTGAGCTCGTGCGGATCTATAAAGGTCACAA 

TCATGCAGTGACTGGTGGTGAATATCCTAGGAAAAGTGATGGTGACTGCTTGCCTGG 

Sequence 1727 

CNCGCGTCCGGATNAATATTTTCATCCCTGAGGTTAACMTTACCATCAAAATGTTTTGT 

GGAGACTATGTGCAAGGAACCATCTTCCCAGCTCCCAATTTCAATCCCATAATGGATGCC 

CAAATGCTAGGAGGAGCACTCCAAGGATTTGACTGTGACAAAGACATGCTGATCAACATT 

CTGACTCAGCGCTGCAATGCACAAAGGATGATGATTGCAGAGGCATACCAGAGCATGTAT 

GGCCGGGACCTGATTGGGGATATGAGGGAGCAGCTTTCGGATCACTTCAAAGATGTGATG 

GCTGGCCTCATGTACCCACCACCACTGTATGATGCTCATGAGCTCTGGCATGCCATGAAG 

GGAGTAGGCACTGATGAGAATTGCCTCATTGAAATACTAGCTTCAAGAACAAATGGAGAA 

ATTTTCCAG 

Sequence 1728 

TCGACCACGCGTCCGATCCTGGATCTGGAGAGAGAGCTCTCCAAGCAAATCAACGTGTGC 

CTCTGAGCCAGATGACGGGGTGGGACCCCGGTTAGTAAGGACCGGGCGCCCAGTGGCTAA 

GGCGGTGCCCTGGTGACCAAGGAGAGCCAGACCTGTTGCTCAGGCCGAGCTCCTGGTTGC 

CAGCGAGTTACCACGGGACCAGTCGCGTGTATGGCTGAGACTCATTCCCAGTTTCCAGGG 

CCCGGTATTTGGACACTAGTTGCCMGTCTGGGGCCTGGGGATTTTAGGGACCAGCGGTT 

GTGACCATCTTTCCTGAGCACCMGGGCTTCCCCTTTTGTTGCCAAAAAGGTAGTTCTCG 

CGCTTGCTAGGCTGGCCTCTCTTGCCTCCCCTTGGCCGGGGC 

Sequence 1729 

TCCGAMCACCTGTCATTTTACACAMTGCGTTTTGAATGTCTGAAAGACAGCTCCTGCC 

CTTMTTTAGATGTAAACCATTTAGTTTCAAACTAACCACCTGATAAAATCTATCAACAT 

TTTATCATGAACTAGAGCAGATGTCTGTTATTTGATGTCTATGTTATTTGAGTTTACTGT 

TTMTMGTGMTTCATATCMTTAATCCTGCTAACAAATTTGACACTTAAGGTGATTCT 

GMMTCCTTTAMCTTAAAGTAGATGGMTCTTAAGTATGGGGCCTTTTAGTGTCCGTA 

AAGAAAAACTGCATGCAACAAAATATAGCAGGTCCTCACTTGTTGAGATTCATGGAAATT 

GTGACTTTAAATGAAATGACATGGCTGGGCATGGTGGCTCACACCTGTAATCAGCACTTT 

GGGAGGCCACGGCGGGTGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 

T 

Sequence 1730 

CTGNGAGTTAAATTGGTCCAGAAACAGTTATGACCCTCTTTATACTGCCAAGAAATACGC 

AGTCCCAGCCTTGGAAGCACACTGTGTAGAATTTCTCACCAAACATCTTAGGGCAGATAA 

TGCCTTTATGTTACTTACTCAGGCTCGATTATTTGATGAACCTCAGCTTGCTAGTCTTTG 

TCTAGATACAATAGACAAAAGCACAATGGATGCAATAAGTGCAGAAGGGTTTACTGATAT 

TG ATATA GATACACTCTGTGCAGTTTTAGAGAGAGACACACTCAGTATTCGAGAAAGTCG 

ACTTTTTGGAGCTGTTGTACGCTGGGCAGAAGCAGAATGTCAGAGACAACAATTACCTGT 
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GACTTTTGGGMTAMCAAAAAGTTCTAGGAAMGCACTTTCCTTAATCCCGGTTCCCAC 

TGATGACAATTGAGG 

Sequence 1731 

ACCCCGCGTCCGAGCAGCCTCCAGTTGCCCCTACTTAGTGGCTTGCCCTCTGCCTGCCTC 

AGCTGCTGCCTGACCGGCTGGGGGAGGCACTGGCGGGAGGCCTCGGGCTCCCCTGGAAGG 

GCGCTGGGCTGGCGGGTCAGCTGGTGGTTCTTAGGTTTCCTTCTGTTTGTTAAAAGGGAC 

AATGTGGCCACTTCTCTGTGGAAAGGGAGTTGGTTGGGGGG7TGAGATGGCCCGTGTTCA 

TMCTCAGTTTCCTGTTTTGCACGATGTAAAAACCCTGTCTTTTTGCACGATACAGCCAA 

MGTATTGGCTGAT7TCTTGCTGAGTGCCCTCTTAGTTGGTGTGTGAGGTCTTGGTGGGC 

TCAGGCCAGCTGTTTGCGAGTGTGGGMCTCATAGGTTCTGTCTTTGTCTCTTCCTTTCA 

CCTCATTCTGGTAGCAGCATAAAGGTTAGGCAATCACTGGGACCC 

Sequence 1732 

GCAAATCATACATTGCATTCCCCAAAGCATCTGAACGTACTTCTAGAAAACAAACCAACC 

AAAAGGGAAMTATGCATGCTTTTTGTAATTAACTGGTGCTTTGAAAATC I MM f A AGG 

GAGMAAATCTCMCCAMGTTATGCTCATCCAGACMGCTGACCTTTGAGTTAATTTCA 

GCACAACTCATTCTTCAGTGCCTCATGACTGAAAACAAAAAACAAAAAAAGAAAGCATCT 

TCNCMTGMGCTTCCANATAGCACCGTTTTGCTAAMGATACATTCTCATTGTTTTCCA 

ACAGNGATGGCTTCCACATAANGGTTAAACAAACTGGGNGCTTGGAAATAATTTNTNACN 

GGTTACTTNTTCGCATTTTTTNGAACNAAGGAAANGGGATTCCC I M M 1 1 AGGGGGGAA 

GAATTGNGGNCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1733 

CGCGTCCGAATTTAGGAAGACCCCGGCGACCTGTTCCTCACCCCCGCTTCGCCCTCACAC 

TTTCGGGATGTCTGCGATTCCTGCTGAGGAGAGCGACCAGCTGCTGATCCGACCCCTTGG 

AGCTGGGCAAGAAGTAGGAAGATCATGTATTATTCTCGAGTTCAAAGGAAGAAAAATAAT 

GCTCGACTGTGGGATCCACCCTGGCCTAGMGGAATGGATGCTCTTCCTTATATTGATTT 

MTTGGACCCACCTTGGAATTGNACCTCCNANTAANTNANNNCATTTTCATTTTGGAANC 

^GGGGGGGCNTNNCCCCNNGGTTTTTCNNAAAMNNCCCCTTNAAANGNNAAAMATTT 

TTTTNTTTTTTNTCCCANAAAAAA I I I M M I NAAAANNCCCNCTNNTNTNAGGGGNTNT 

TNAAAAAATTNCCCAAAAAAANNNNCCCCCCT1TNTNTNTTTTNAAAAAAAAAAAAAAAA 

AAAANATNGNQCCCNNAA I M M M M 1 1 ATATTTTTNANNAMANAAANNTNTTTTTTN 

AAAAAANTNTAT N I M M M I CNNTNNANNNNGTGNNNNCCCNNN I iTTTTTTTm 

Sequence 1734 

CCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGGGCCGCCCGACC 

CCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGTCGATTCTGCTGCTTTTCCTGACG 

GAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATAACGCGGAGATCTGTCTC 

CTGCCCCTAGACTACGGACCCTGCCGGGCCCTACTTCTCCGTTACTACTACGACAGGTAC 

ACGCAGAGCTGCCGCCAGTTCCTGTACGGGGGCTGCGAGGGCMCGCCMCMTTTTTAC 

ACCTTGGGMGGTTTGCNACNATTCTTTGTTTGGANGANTAAMAAAGGTTCCCAAAATT 

TTCCCNCTTTNAAAAGANNNTNGNGACCNNNCACTNGGGGGGGGGCCCCAAAAAAAAATT 

TTTTTTTNTTAANNCCCCCCCGGGGNNGNAAAAAN I WWW I GGGGGGGGNCCCCCCCC 

NNGGGNNAAAANNNTTTTANAAAAAA'I Ml IIIIIIINIII I I CCCC AAAAAAAAATTTT 

T 

Sequence 1735 

GCGTCCGAAAATACAATACACGGACTCTTTCGAGGCCCTGTAATTGGAATGAGTCCACTT 
TAAATCCTTTAACGAGGATCCATTGGAGGGCACTTCCAGAATACCTCCTCCCCCAGCGCC 
CGCTGCCAGCCCCACACCAGGTGTGAGAACCAAGGTCTGGTGGAGGCAGCTCCAGGCACT 
GCCCAGTCCGACACAACCTGCAAAAATCCATTAGAGCCACTGCCCCCAGAGATG 
Sequence 1736 

CGAACCTCCTGGTTCCAAGTGGGAGACATGGTGCTGTCGGAGCTAGCGGCGCGCCTCAAC 
TGCGCCGAGTACAAGAACTGGGTGAAGGCGGGCCACTGCCTGCTACTGCTTGCGCAGCTG 
NCTGCAGGGTTTCGTCGNCCGCNAGGTGCTCTCCTTTCCACCCGCGGCCTACTCGCCGCA 
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GCCCCCGGC 
Sequence 1737 

ATCCTTTTGCCTAAAGATGTAAACAAAACTCAAGACAGAAGGAATCAGGGAATATGTGCT 

NTTGTGNGCATCTTGTTTACATTTNGGGATCAANTGATGGCAAAAGAAGTAATGAGACCA 

CTNNAMTTGTTTTNCANTTGNNTTTAAAANACCAGGGTTCCTCATTTTCTTTGA 

NMGTTTTMCAATTGACCTTCTTAAGNGACATTCTCTTTCAAAAAAGANANGTAAANCA 

GGNGAAATGAAGGGTGGNTGGGGAAA 

Sequence 1738 

CCGGCTCATTCCCTGAGGCCGGCCCGCGTCCCGTCAGGGCGCCGCGCGGGGTTAGCGCGG 

GGTCAGCGGAGGTCAGCGGGGGTCAGCAGCAGCGGCTCCGAGGGCGCGGCGGACGCAGGA 

TGTACACGCTGCTGTCGGGC7TGTACAAGTACATGTTTCAGAAGGACGAGTACTGCATCC 

TGATCCTGGGCCTGGACAATGCTGGGAAGACGACCTTCCTGGAGCAGTCGAAAACCCGAT 

TT 

Sequence 1739 

GTCCGNCCACGCGTCCGATTTCTTGTGTGCTTTGAAAAMGTTTCAGCTTGCTGTCTCTT 
TTAGTGTTTTAAAGMGTGTTATACAAAGCATTGTTTGCAAAATATAGGGAGGATAATGG 
GAGTCCCACTTTMNTTNGGGMNTCNTTGGGNGANCTTNTTNATCCAAGGTTTANTCAA 
GCCTTCNTTTTCCMCTTTTAAAAMTTTTTTGT^ 

AAMATTTTTAAAMTATTTTANTGTTCTTGCNACCAAATTTGTTCTTNAAAAM 
AAMCTTGNTGGCAAA7TCNTTTGGTCNNTTTAAAAAAAAAAAAAA 
Sequence 1740 

ACGCGTCCGCAGCCATCTTGGGATCTGGGCAAGTGAGCGAGCTCCTTCCTCACCGGGCTG 

ACTAGCCTCTCCTTTCCCTGTCCCCCTCCATCGCTGCTCTGCAGGAAGCCAGCCCCCAGG 

GCCAGTCCCGGAGGGGCTGATCCGCATCTACAGCATGAGGTTCTGCCCCTATTCTCACAG 

GACCCGCCTCGTCCTGAAGGCCAAAGACATCAGACATGAAGTGGTCAACATTAACCTGAG 

AAACAAGCCTGAATGGTACTATACAAAGCACCCTTTTGGCCACATTCCTGTCCTGGAGAC 

CAGCCAATGTCAACTGATCTATGAATCTGTTATTGCTTGTGAGTACCTGGATGATGCTTA 

TCCAGGMGGAAGCTGTTTCCATATGACCCTTATGAACGAGCTCGCCAAAAGATGTTATT 

GGAGCT 

Sequence 1741 . 

CANCGCCCCGGCCCGNTCAAGCAGCNTAATAAAGCTCATANAGGCGGACNGGCATCNGGG 

TCTNGGGATCTGCACAGCGGGACTGGCATAGGGCCGTCTGGCACTGAAAACCCTAANCAA 

GAAGGTGTGAAAAGAAACTTNAGCNGANNCGACCNGAGGCATCNNCGCCAGCCAGCTCCG 

AAAGCAGAAGANGGAGGCGGTTCTGGCATGAGAAAGANACAGCTGGGCTGGCNAGGATAG 

GCCCCTCCTTCATCAAGGTACTGGTGGTGCCCCTGCACAGNANAATTNTNCCTGCCAGAG 

GCCATGCANCCTGCTTCAAGATAGGGACACTGGAACAGATACACTTGAATGAATTGGGGA 

AACACCCCAGAACTTTT 

Sequence 1742 

CACGCGTCCGTTTTTTNCMGGGTCTATTGTTTCGATTAGTTTCCTTGCAGGAGGTAGAA 

GGmCCTCCATCCCGAGTMTTCATAGAGTGTTTGGCTTGNCTACCTTCCTCCTGACTG 

MGTCACCTGATACTTTTTGTTTTTTCMGMGGAAGAGAGAACCCTGTTGCCTCAGTTA 

CTAGCAATG ATACAATTCTCAAAATCTGGTC I I I I I I I GTTTCTTTGAAATAGTTTCTCC 

ATGTTGTGTGACACAGCAGCCCCTGTCTTATCATAGTTGTCTTCCCTCCACCACCTGTAC 

CAGAGATGTTGGATATGTTGGAGGTGMGGTGTGCAAGGTTTTTAACTAACTGTTCTAAT 

T AMGGATT CTGCAGGAMGMCATGGGTTTACAAMGAGAAGCTTTTGTATTATTAGTA 

A I I 1 1 I I I ICTTTGATGAATTTATGTGCTTAGTTTGGAGAATCGAGAGTTGGCTGGGAAA 

AGATTTCTGAGGAGTTAAGGGACTCTGGTGCTGTTTGGGAA 

Sequence 1743 

GTGATTAGAAGTAAGCGNTGATGAGGCTGAAGAAAAGGAAGACAAAACTGAGTACTTGGA 

GGAACGAAGAGTAATGGGATATCCAATAAACTGAGAATGTTTCTTCACAATCTCCTTTAT 

TCTTCGTTCCTCCAAGTACTCAGTTTGGTCTTCTTTCAGGTGTAGGAACGTGATAAAGAA 
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GTAAGCGATGATGAGGCTGAAGAAAAGGAAGACAAAGAAGAAGAAAAAAAAAAAAAAAGG 
Sequence 1744 

CGATTTCTTTGTTGGACAACCCAGCTGGGGCTAGGAATGGTTCAGAAGGTTTAAGGCCGG 

MNGGGNMTGMGGGGCCCGGCGCTMCCCTCTAGGGACCTGTTTTGCTTCTGTTTAAA 

CCAAATGGGCAGTCTGTCATTACACACACCCTGNGTCTTCATATGTGGCTCGCCAGTATA 

ATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCA 

ATGGCATTCCGGGTACACCTGGGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGG 

AATGTCTGAGGGAAAGCTTTNAGGAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGA 

GTTCATTGAATTATGGCATANATCTTGGGAAAANTGCGGAGTGTACATTTACAAANATGC 

TTTTCAAATAGTTGCTNTAANANTTTTGTTCAG 

Sequence 1745 

GGACGCGTGGGTGGAAATGTAMCMGAATAGACTGTTCATTCCTGATGGCTTTTAGTCT 

ATACTAACATATTGTTTGTCATGGCATCCGAGACTGAAAAGACCCATGCTTTACTGCAGA 

CTTGTAGCACTGMTCTCTTATTTCCAGCCTTGGGTCTGGGGGCATTTTGCCTCGTAGCT 

GACAGACTTCTTCAGTTTTCCACAATTCAGCAAAATGACTGGCTTCGTGCTCTCTCAGAT 

AATGCAGTACATTGTGTAATTGGCATGTGGTCATGGGCGGTAGTCACTGGAATCAAGAAG 

MG ACTGAC TTTGGAGAAATCATTTTAGCTGGAT7TTTAGCCTCTGTTATTGATGTAGAC 

CACTTTTTTCTAGCTGGATCCATGTCTTTAAAGGCTGCTTTGACTCTCCCGCGAAGACCT 

T TCCT TCACTGTTCTACTGTGATTCCCGTTGTGGTTCTGACCCTGAAATTTACTATGCAC 

CTTTTCAAGCTCAAAGACTCATGGTGCTTTCTTCTGGGATGGTATTTATATCC 

Sequence 1746 

GTGMCAGG^^^ATTGACTATGGTAAACTTATTTTATTGAAGTTTTCAACCGGAAAATAGT 

MG TGGAA TATGATACAACTGTTATTTTCAGAACATATTTCTTTAGGGCTATTTAAAATA 

ACCTTTTTAAAGGGCAAAAACTTTCMTTTGAGAGAACAAATTCCTCTCCTCTGTGGGAA 

ATATTGGCTGAGATTTTGTATAGAATAAGAGACATGTATGTAACATATATTTATATTCAG 

CATMGTCTACTGCMTCATGTACACATCTTAGCMGACGAGAGGATTTTGTTTAGTCTT 

TGTTTATGACTTCTACAGTTTCCTGTATCTAGTGTTAAGTTGTAAGGAAAAACTAAACAT 

GCMTTTA AAGG TAAACTTGATAACTATTTATGGAACATAAGCATACACCAATGGTTATT 

TCTCACAGTTTTCATGCGCATTTGTTTATTGTTTACTTGGATTAGGCTCATTAAAAACCA 

TAATGCTGGTCACAATTAGMTGCTAATATTTGGGGAAGCTATGCAGAAAATATTT 

Sequence 1747 

CNTGTGTGCCATGTATACCTAACGGGAGTCCCAGAAGACAGGAGAGAAAAAAAGAAAGAA 

ATAAAAAGAATATTTGMTTTAAMTTGCTTGAAAATGTCTCAAATTTGATGAAAAATAT 

TACTCTGCACATTCAACCCATGAACTATAAGTTGTATAAAATCAAAAAGTTTCACACCAA 

GGCGTGTCATAGCCAAACTGTCAAAAGCCAAAGACACAGAATCTTGAAAGCAGTGAGAGC 

AAAGCAGACAAGGGATCCCCAATAGGATTAACAGCAGATTTCTCATCAGAAGCCATGCAA 

GCCAGAAGGCTATGGGAGACATACTCAAAATGCTGAAATAAAAGACTGTCAACAAACATT 

TCCACATCCAGCAAAAATCAAAAACGAAGGAGAAATCTGTTGCATGTGAGCTGAATAGAA 

TTTGTATTCTGCTGTTGTTGGATTGAAGTATTCTTTAAATGTCAATTANATCAATTTG 

Sequence 1748 

CGATGGCAATACATGTACTCAGATAGTTACATCCCTATATAAAAAGTATGTTTACATTTA 

AAAMTTAGTAGATMCTTCCTTTCTTTCAAGTGCACMTTTCATTTTGACTTGAGTCAA 

CTTTTGTTTTGGAACAMTTMGTMGGGAGCTGCCCAATCCTGTCTGATATTTCTTGAG 

GCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTMGTGGGATTCTTAAT - 

ATCACCATTCTTGGGACTGGTATACATMGGCAGCCGTGAAACTGGAAAGTCATTTTGAT 

GACTGATGTGATACATCCAGAGGTAAAATGCATTTAAACATATTAAAGTATTTGCCAAAG 

ATACAATTTTCTTGCTGACATAAAAATCACACAAACAAGTCCCCCCCAAACCACAACTGT 

CTCTCAAATAGCTTMAAAAATTGAAAAACATTTTAGGATTTTCAAGTT^ 

AAAAAGATGTTCAAGCTATTAGAGGGAATGGTAAA 

Sequence 1749 
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GCCCNTCCCGACCCTCCAGCTCATGGTGTCTGGGGCCTGCGGCTAGACTCTTGGAACATT 

CTGGAACTCTCTCCTTTCCTGGCTGGGGCTCTGACCACAAACTCCCCTCCAGGCTGCCCC 

TGGGACATGGTGGTGATGTGGGTGCAGGAGCCAGTGTCTGTTGTCGGGACTCGCAAGTGC 

CCTCATCACAGCCACCCCCACCACGAGTGTCTCCCCAGTGCAGACTCAAGTTATGCTTGA 

AATGAAAAAGTCTATCTGGTAGTGGGTAAACGTAGACCTGGCACTGTTCCACGCGGGCGC 

CCCAAGCCTGCCACTCCTGTGTCCCTGCCTCCCTGGGCTCCCGAGATAGGCACCACTGTA 

TCCTCCAGCTCCTTCCTTCCTTCCCCCAGGAACACGGAGGCCACCGAGGGGGCTGGGCCT 

ATCAGGAGGACACAGGCTGCAGCCTGGCACCCACCCCTCCATCTTCACCCCACATGGAAG 

ACTTGTCTTCAAC 

Sequence 1750 

GCGTCCGGATTCTAAAMAAAAAAAAAAAAAAAAAAAAAATTMTTGGGGTGCCTTTTTG 
TTATAGTTTCTATTTTCTGTTTTGTAGGACMGCTGCATTTTCTGTAAATATAGGTCTGG 
ACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTAT 
TTATCTCTAAGTATATTTGAAGTGATTG^ 

TCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAAATAAAGCTCTCAGGGAGACATGA 
ATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCT 
TAATATCC TTAAATTATGTGATATATAAAGAGGACTGTTAC 1 1 1 I I I ACTTTC I MINI 
I I I 1 1 1 1 1 GGCTTTGCTTTATTTATTTGGAGT 
Sequence 1751 

GGGCATGCTCATAGGCACAGCTGTTGGTCAGTATGCCAATAACATCACACTTTGGATCTT 

TGCAGTCACTGCAGGCATGTTCCTCTATGTAGCCTTGGTGGATATGCTTCCAGAAATGTT 

GCATGGTGATGGTGACAATGMGAACATGGCTTTTGTCCTGTGGGGCAATTCATCCTTCA 

GMTTTAGGATTGCTCTTTGGATTTGCCATTATGCTGGTGATTGCCCTCTATGAAGATAA 

MTTGTGTTTGACATCCAGTTTTGACCTTTCCCAGTAATCACTGTTGATTACGAGAATGT 

TACCATGCAGCTTTGCATCTGTTCCTTGTACTGTATGCACATTGCTCAAAGGAAAGTCAG 

TGGCTT GCAC TACTTACMGTTTCATAGATTTGAGCCTAACCACAAGAGGCTGGTGCTTA 

GTACTGTTTTCCCTGCACGTAGGGGTCTTTTAAAAATATAAAGCTTGTGATAAAGAGAGG 

A 

Sequence 1752 

CTGGTTCAGCAGCCGCCCACCCACCTCTGAGTCTGACCTGGAACCTGCCACAGATGGGCC 

AGCCTCCGAGACCACTACCCTCAGCCCAGAGGCCACCACCTTTAATGACACCAGAATCCC 

TGATGCAGCTGGTGGCACGGCCGGCGTGGGTACCATGCTTCTGTCCTTTGGGATCATCAC 

GGTGATAGGCCTGGCTGTGGCCTTGGTTTTGTACATCAGGAAGAAGAAGAGGCTGGAGAA 

GCTACGCCACCAGCTCATGCCCATGTACAACTTCGACCCCACGGAGGAACAAGATGAGTT 

GGAGCAGGAGCTGCTGGAGCATGGGCGGGACGCCGCCTCTGTACAGGCTGCTACTTCTGT 

GCAGGCCATGCAGGGCAAGACTACTCTGCCCTCCCAGGGCCCACTCCAGAGACCCAGCCG 

GCTTGGTGTTTACCCGATGTGGCCAATGCCATCCATGTGTGAGTGGCCTGGGACAAGC 

Sequence 1753 

GTCGCCCCGCGTCCGGTGCTCTCATGTCATCTCAGAGTTCCAGCTTATCAGAGGCATGTA 

GCAGGGAGGCTTATTCCAGCCATAACTGGGCTCTACCTCCAGCCTCCAGAAGTAATCCCC 

AACCTGCATATCCTTGGGCAACCCGAAGAATGAAAGAAGAAGCTATAAAACCCCCTTTGA 

MGCTTTCATGAAGCAGAGGAGGATGGGTCTGMCGACTTTATTCAGAAGATTGCCAATA 

ACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATCTCCCAACCTCAGG 

AGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCTCAGCAAATCAACC 

TTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTCTTGAAAGTGATCG 

GAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCA 

Sequence 1754 

TCGCCCCGCGTCCGGACTGATCATAAACCATGCTGGTATTGCACCTTCTGGAACTATGGG 
CTTGAGAAAACCCCCAGGATCACTTCTCCTTGGCTTCCTTATTTTCTTGAGGCAGGTCGC 
ACGTTCTACCTGCCCAAGACGTGTGATATCAGCTTCTCAGATCCAGACGACCTCCTCAAC 
TTCMGCTGGTCATCTTGTCCTGATGAGGGCTTCTACAAGAGTGGGAAGTTTGTGTTCAG 



